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Abstract  of  Dissertation  Presented  to  the  Graduate  School 
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Chairman:  Dr.  Sartaj  Sahni 

Major  Department:  Computer  and  Information  Science  and  Engineering 

In  this  dissertation,  we  develop  efficient  algorithms  and  data  structures  for  prob- 
lems that  arise  in  electronic  computer  aided  design  (ECAD). 

We  consider  the  problem  of  joining  a  row  of  compacted  cells  so  as  to  minimize  the 
area  occupied  by  the  cells  and  the  interconnects.  The  cell  joining  process  includes 
cell  stretching  and  river  routing.  We  propose  several  heuristics  to  join  a  row  of 
cells  in  such  a  way  that  area  is  minimized.  The  proposed  heuristics  are  compared 
experimentally  with  the  previously  proposed  heuristic. 

We  develop  a  new  class  of  weight  balanced  binary  search  trees  called  /^-balanced 
binary  search  trees  (yS-BBSTs).  /?-BBSTs  are  designed  to  have  reduced  internal  path 
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length.  As  a  result,  they  are  expected  to  exhibit  good  search  time  characteristics. 
Individual  search,  insert,  and  delete  operations  in  an  n  node  /?-BBST  take  O(logn) 
lime  for  0  <  /3  <  v/2  -  1.  Experimental  results  comparing  the  performance  of 
BBSTs,  WB(a)  trees,  AVL-trees,  red/black  trees,  treaps,  deterministic  skip  lists  and 
skip  lists  are  presented.  Two  simplified  versions  of  /?-BBSTs  are  also  developed. 

We  propose  the  weight  biased  leftist  tree  as  an  alternative  to  traditional  leftist 
trees  for  the  representation  of  mergeable  priority  queues.  A  modified  version  of 
skip  lists  that  uses  fixed  size  nodes  is  also  proposed.  Experimental  results  show 
our  modified  skip  list  structure  is  faster  than  the  original  skip  list  structure  for  the 
representation  of  dictionaries.  Experimental  results  comparing  weight  biased  leftist 
trees  and  competing  priority  queue  structures  as  well  as  experimental  results  for 
double  ended  priority  queues  are  presented. 


CHAPTER  1 
INTRODUCTION 

1.1  Background 

In  VLSI  layout,  we  are  concerned  with  transforming  a  circuit  from  its  logical  de- 
sign to  a  physical  implementation.  The  layout  problem  for  VLSI  circuits  is  generally 
decomposed  into  smaller  problems  such  as  partitioning,  floorplanning,  placement, 
routing  and  compaction. 

The  pcirtitioning  process  decomposes  a  large  circuit/module  into  a  collection  of 
smaller  sub-circuits/modules.  In  floorplanning,  logical  components  of  a  circuit  are 
assigned  relative  positions  on  a  chip.  The  physical  realization  of  each  component  (i.e., 
its  area  and  aspect  ratio)  is  also  selected.  The  objectives  of  floorplanning  include 
overall  area  minimization,  minimization  of  power  consumption,  etc.  The  precise 
locations  for  the  components  of  a  design  are  then  determined  during  the  placement 
process  to  optimize  the  area  and  the  timing.  After  the  components  are  placed,  the 
pins  are  connected  during  the  routing  process.  During  the  process  of  compaction, 
the  components  and  interconnections  are  moved  so  as  to  further  optimize  the  layout 
in  terms  of  area  and  delay. 

The  routing  process  is  usually  divided  into  three  smaller  sub-problems  of  global 
routing,  detailed  routing,  and  specialized  routing.   Global  routing  decomposes  the 
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complex  routing  problem  into  small  and  manageable  sub-problems.  It  assigns  each 
net  to  a  set  of  routing  regions  such  as  channel,  switchbox,  and  planar  routing  to 
minimize  a  combination  of  criteria  such  as  area,  circuit  delay,  etc.  Steiner  trees 
and  spanning  trees  are  the  commonly  used  approaches  for  net  connection  in  global 
routing.  Specialized  routing  is  used  to  connect  power-ground  nets  or  clock  nets. 
Detailed  routing  has  two  types  of  routing,  general  and  restricted  and  there  are  three 
types  of  detailed  restricted  routing,  channel,  switchbox,  and  planar. 

In  channel  routing,  all  terminals  of  nets  are  located  in  two  parallel  rows  across 
a  routing  region  called  channel.  In  switchbox  routing,  terminals  of  nets  are  located 
on  the  four  sides  of  the  routing  region.  Planar  routing  is  a  problem  in  which  the 
interconnection  topology  of  the  nets  is  planar.  That  is,  all  connections  can  be  realized 
on  a  single  layer.  Vias  allow  wires  to  change  layers  but  the  presence  of  vias  reduces 
reliability  and  performance  of  a  circuit.  Single  layer  routing  is  not  always  possible. 

River  routing  is  a  special  case  of  planar  routing  in  which  all  nets  have  exactly 
two  terminals,  one  on  each  side  of  the  channel,  and  the  net  sequence  on  each  side 
of  the  channel  is  the  same.  River  routing  is  used  in  PCB  routing,  particularly  in 
dataflow  architectures  with  multi-bit  buses  connecting  a  series  of  logic  blocks,  and 
symbolic  IC  design  systems. 

1.2    Dissertation  Outline 

This  dissertation  is  divided  into  four  chapters.  In  Chapter  2,  we  consider  the 
problem  of  joining  a  row  of  compacted  cells  so  as  to  minimize  the  area  occupied  by 
the  cells  and  the  interconnects.  The  cell  joining  process  includes  cell  stretching  and 
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river  routing.  We  propose  several  heuristics  to  join  a  row  of  cells  in  such  a  way  that 
area  is  minimized.  The  proposed  heuristics  are  compared  experimentally  with  the 
previously  proposed  one. 

VLSI  Physical  Design  Automation  is  essentially  the  study  of  algorithms  and  data 
structures  related  to  the  physical  design  process.  Specific  data  structures  can  be  used 
to  improve  the  performance  of  algorithms.  For  example,  maze  routing  algorithms  and 
line-probe  algorithms  in  global  routing  [30]  use  search  structures,  retiming  algorithms 
[6,  29]  use  priority  queue  structures,  and  layout  compaction  algorithms  [13]  use  radix 
priority  search  trees. 

Chapters  3  and  4  are  about  search  and  priority  queue  structures.  As  specific 
data  structures  could  be  used  to  produce  better  performance  of  algorithms  in  VLSI 
design  automation,  new  search  and  priority  queue  structures  are  proposed  and  thor- 
oughly compared  with  other  data  structures. 

FincJly,  in  the  last  chapter,  we  present  conclusions  of  this  work. 


CHAPTER  2 

MINIMUM  AREA  JOINING  OF  COMPACTED  CELLS 
2.1  Introduction 

When  designing  circuits  with  compacted  symbolic  sticks  basic  cells,  the  circuit 
is  realized  by  a  collection  of  compacted  cells  that  tile  a  two-dimensional  area.  The 
intercell  interconnects  are  such  that  each  interconnect  connects  two  terminals  that 
are  on  adjacent  boundaries  of  neighboring  cells.  So,  for  example,  if  cells  A  and  B 
(Figure  2.1(a))  are  neighboring  cells  of  the  circuit,  then  the  right  boundary  of  A 
is  adjacent  to  the  left  boundary  of  B.  The  number  of  terminals  on  each  of  these 
boundaries  will  be  the  same  and  the  t'th  terminal  (from  the  bottom)  on  the  right 
boundary  of  A  is  to  be  connected  to  the  t'th  terminal  (from  the  bottom)  on  the  left 
boundary  of  B. 

Since  the  cells  are  available  in  compacted  form,  it  is  not  possible  to  reduce  the 
distance  between  any  pair  of  terminals  on  any  side  of  a  cell.  However,  this  distance 
can  be  increased  by  stretching  the  cell.  In  the  example  of  Figure  2.1(a),  we  can  stretch 
either  cell  vertically  by  defining  a  horizontal  cut  line  at  any  position  and  pulling  the 
two  cell  pieces  apart  by  any  desired  amount  (the  cell  can  also  be  stretched  horizontally 
by  using  a  vertical  cut  line). 
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(a)  Horizontal  adjacent  cells 


(b)  Joining  by  stretching 


(c)  Joining  by  river  routing  (d)  Combination  cell  joining 

Figure  2.1.  Cell  joining 


The  required  interconnects  between  cells  A  and  B  of  Figure  2.1(a)  can  be  ac- 
complished by  stretching  cells  A  and  B  so  that  the  terminals  of  A  and  B  line  up  as 
in  Figure  2.1(b).  The  broken  lines  in  Figure  2.1(a)  indicate  the  cut  lines  used  for 
stretching.  The  stretching  enables  us  to  join  cells  A  and  B  using  no  routing  tracks 
(by  "join"  we  mean  make  the  interconnects  between  cells  A  and  B).  This  method  of 
joining  cells  is  also  called  pitch  matching. 

Another  way  to  join  cells  A  and  B  is  to  river  route  the  interconnects  as  in 
Figure  2.1(c).  This  uses  routing  tracks  in  a  channel  between  cells  A  and  B  but  does 
not  increase  cell  height.  The  pitch  matching  and  river  routing  approaches  to  cell 
joining  have  been  studied  in  Boyer  [5]  and  Weste  [33].  Algorithms  for  single-layer 
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river  routing  can  be  found  in  several  works  [15,  19,  23,  24]  and  those  for  multilayer 
river  routing  can  be  found  in  Baratz  [3].  Single-layer  gridless  river  routing  is  studied  in 
Tompa  [32].  Two  applications  of  river  routing  are  hybrid  circuit  design  and  structured 
design  (DSP). 

Cell  stretching  (or  pitch  matching)  increases  the  height  of  the  layout  while  river 
routing  increases  its  width.  Both  affect  the  layout  area.  The  layout  of  Figure  2.1(b) 
has  area  150.  To  compute  the  area  of  the  layout  of  Figure  2.1(c),  we  assume  tracks 
have  unit  separation.  So,  the  layout  width  is  14  and  height  is  11.  The  layout  has  area 
154.  Cheng  and  Despain  [8]  have  proposed  using  a  combination  of  cell  stretching  and 
river  routing  so  as  to  obtain  layouts  with  smaller  area  than  possible  when  only  one 
of  these  joining  methods  is  used.  Figure  2.1(d)  shows  the  result  of  joining  cells  A 
and  B  using  both  stretching  and  river  routing.  The  area  of  this  layout  is  144.  This 
is  minimum  for  the  instance  of  Figure  2.1(a). 

Cheng  and  Despain  [8]  have  proposed  a  heuristic  for  single  layer  joining  of 
compacted  cells.  At  each  step  of  their  heuristic  either  a  row  or  column  of  compacted 
cells  is  joined.  Following  this,  the  row  or  column  of  joined  cells  is  replaced  by  a 
composite  cell  that  represents  the  result  of  joining.  Notice  that  when  a  row  (column) 
of  cells  is  joined,  cells  may  be  stretched  vertically  (horizontally)  and  river  routing  is 
done  in  a  vertical  (horizontal)  channel.  To  join  a  row  of  cells,  Cheng  and  Despain  [8] 
bound  the  maximum  height  to  which  a  cell  may  be  stretched.  This  bound  is 

"'max  max  } 
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where  /i^ai  is  the  height  of  the  taJlest  compacted  cell  being  joined  and  havg  is  the 
average  height  of  the  cells  being  joined. 

Using  this  bound,  cells  are  joined  one-at-a-time  using  a  penalty/reward  scheme 
to  determine  if  a  pair  of  terminals  is  to  be  joined  by  stretching  or  by  river  routing. 

Lim,  Cheng,  and  Sahni  [17]  have  considered  the  case  when  only  two  cells  are  to 
be  joined.  They  develop  fast  polynomial  time  algorithms  to  obtain  the  minimum  area 
join  of  two  cells.  In  addition,  they  are  able  to  obtain,  in  low  order  polynomial  time, 
minimum  area  joins  that  minimize  the  length  of  the  longest  wire  or  the  total  wire 
length.  Lim  [16]  has  proposed  an  0{n{nfcY~^)  algorithm  to  find  the  minimum  area 
join  of  c  cells  having  a  total  of  n  terminals.  This  algorithm  does  an  exhaustive  search 
over  all  possible  numbers  of  tracks  in  the  c  -  1  routing  channels  between  adjacent 
cells.  A  constraint  graph  is  used  to  determine  the  minimum  height  layout  for  each 
assignment  of  number  of  trax:ks  to  routing  channels.  The  time  required  per  track 
assignment  is  0(n)  and  the  worst  case  number  of  track  assignments  is  0((n/c)''"*). 
The  algorithm  of  Lim  [16]  is  flawed  as  it  handles  channels  with  zero  routing  tracks 
by  joining  the  adjacent  cells  using  minimum  height  cell  stretching  and  then  considers 
the  joined  cells  as  one.  This  problem  is  easily  fixed,  however,  by  combining,  in  the 
constraint  graph,  pairs  of  vertices  that  represent  corresponding  terminals  of  the  two 
cells  (i.e.,  i'th  terminals  of  each  cell)  with  zero  routing  tracks  in  between. 

In  this  chapter,  we  consider  the  case  when  /  >  1  routing  layers  are  available  to 
river  route  the  inter  cell  connections.  Note  that  while  multiple  layers  do  not  affect 
layout  area  when  cell  stretching  aJone  is  used,  a  reduction  in  area  is  possible  when 
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(a)  Mayer  (b)  2-layer 

Figure  2.2.  /-layer  river  routing 

cell  stretching  is  combined  with  river  routing  or  when  river  routing  alone  is  used. 
We  assume  that  in  each  layer  of  each  routing  channel,  the  interconnects  are  to  be 
accomplished  using  river  routing.  An  alternative  is  to  use  HV  routing  when  /  -  2, 
HVH  or  VHV  routing  when  /  =  3,  and  extensions  of  HVH  and  VHV  routing  for 
/  >  3.  However,  for  river  routing  instances,  using  routing  layers  in  this  way  has 
no  advantage  over  river  routing  in  each  layer  (see  Theorem  5,  Section  2).  When 
the  number  of  layers  available  for  river  routing  is  increased,  one  may  see  a  dramatic 
reduction  in  the  number  of  routing  tracks  needed  per  layer.  Figure  2.2  shows  an 
instance  that  needs  n  tracks  when  routed  in  one  layer  but  only  one  track/layer  when 
routed  in  two  layers. 

We  begin,  in  section  2,  by  stating  the  necessary  and  sufficient  conditions  for 
a  river  routing  instance  to  be  routable  in  /  layers  using  at  most  t  tracks  per  layer 
and  stating  how  to  perform  Mayer  river  routing  when  such  a  routing  is  possible.  In 
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this  section,  we  also  show  that  HV  style  routing  has  no  advantage  over  river  routing 
in  each  layer.  In  Section  2.3,  we  describe  the  constraint  graph  used  to  determine 
minimum  height  stretching  of  c  cells.  Heuristics  for  the  minimum  area  joining  of 
c  cells  are  proposed  in  Section  2.4  and  the  results  of  experiments  with  these  are 
provided  in  Section  2.5.  Our  conclusions  appear  in  Section  2.6. 

2.2    Mayer  River  Routing 

Let  {Ai,Bi),  1  <  j  <  m,  be  a  set  of  terminal  pairs  such  that  the  Ai's  are  on 
one  side  (say  left  or  top)  of  a  routing  channel  and  the  B.'s  are  on  the  other  (right 
or  bottom)  side.  Terminal  A,  is  to  be  connected  to  terminal  Bi,  I  <  i  <  m.  For 
this  channel  routing  instance  to  be  an  instamce  of  river  routing,  it  must  be  the  case 
that  oi  <  02  <  . . .  <  and  6i  <  62  <  •  •  •  <  ^'m  where  a.  and  6„  respectively,  give 
the  positions  of  terminals  A,  and  Bi,  I  <  i  <  m.  We  may  assume  an  underlying 
grid  with  each  terminal  being  at  a  grid  position.  In  the  case  of  a  horizontal  (vertical) 
channel  the  a,s  and  6,s  are  grid  column  (row)  numbers.  Leiserson  and  Pinter  [15]  have 
obtained  the  following  necessary  and  sufficient  condition  for  a  river  routing  instance 
to  be  routable  in  a  single  layer  using  at  most  t  >  0  tracks. 

Theorem  1  [15]  The  river  routing  instance  defined  above  is  routable  in  a  single  layer 
using  at  most  t  >  0  tracks  if  and  only  if 

(a)  Gi+t  -  b,>t 

(b)  b,+t  -  a,>t 
for  every  i  <  m  —  t. 
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For  the  general  case  of  /  >  1  layers,  we  obtain  the  necessary  and  sufficient 
condition  of  Theorem  2. 

Theorem  2  The  river  routing  instance  defined  above  is  routable  in  I  >  \  layers  (each 
layer  routing  whole  nets)  using  at  most  t  >  0  tracks  per  layer  if  and  only  if 

(a)  ai+it  -bi>t 

(b)  b,+it  -a,>t 
for  every  i  <  m  —  It. 

Proof:  First,  we  establish  that  (a)  and  (b)  are  necessary  for  /  layer  routing.  Since 
the  proofs  for  (a)  and  (b)  are  similar,  we  provide  that  for  (a)  only.  Suppose  that 
Oi+zt  -  bi  <  t  for  some  i.  Consider  the  It  +  \  terminal  pairs  {A,,B,\i<j<i^-lt. 
When  routing  these  on  /  layers,  at  lea^t  one  layer  has  to  be  assigned  >  i  +  1  terminal 
pairs.  So,  suppose  that  terminal  pairs  {A^.Bi^),        (^'t+i,  ^t'+i)'  •••  ^""^ 

assigned  to  the  same  layer  for  river  routin  g.  We  may  assume  that  a\  <  a'^  <  .  ■  ■  < 
a\^,  and  <  <  . . .  <  tj+i-  Since  a\^^  <  a.+„  and  b\  >  6.,  a[^,-b[  <  ai+n-k  <  t. 
From  Theorem  1,  it  follows  that  the  terminal  pairs  [A'j,  B'^),  \  <  j  <  t  +  1,  cannot  be 
river  routed  on  a  single  layer.  Hence,  {A,,  B,),  i<j<i  +  lt,  cannot  be  river  routed 
on  /  layers.  So,  [A^,  1  <  J  <  m,  cannot  be  river  routed  on  /  layers.  As  a  result, 
(a)  is  a  necessary  condition. 

To  show  that  (a)  and  (b)  are  sufficient  conditions  for  routability,  we  present  two 
algorithms  (RoundRobin  and  Greedy)  that  assign  the  nets  to  layers  in  such  a  way 
that  each  layer  is  river  routable  when  both  (a)  and  (b)  are  satisfied.  The  correctness 
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procedure  RoundRobin  ; 

{  Assign  the  m  nets  to  /  layers.  } 

begin 

for  i  :=  1  to  m  do 

assign  net  iA,,Bi)  to  layer  (i  mod  /)  +  1  ; 

end  ; 

procedure  Greedy  ; 

{  Assign  the  m  nets  to  /  layers.  } 

begin 

for  i  :=  1  to  m  do 

assign  net  (A,,  5.)  to  layer  q  such  that  q  is  the  smallest  integer  <  / 
for  which  the  conditions  of  Theorem  1  aire  not  violated  on  layer  q 
(if  there  is  no  such  q,  then  fail)  ; 

end  ; 

Figure  2.3.  Round  robin  and  greedy  layer  assignments 
of  these  algorithms  is  established  in  Theorems  3  and  4,  respectively. 


We  later  discovered  that  Baratz  [3]  has  not  only  obtained  the  same  condition 
but  also  proposed  the  same  two  algorithms  for  /-layer  river  routing.  One  assigns 
nets  to  layers  in  a  round  robin  fashion  and  the  other  uses  a  greedy  strategy.  The 
corresponding  procedures  are  given  in  Figure  2.3. 

Theorem  3  The  layer  assignment  produced  by  the  RoundRobin  procedure  is  river 
routable  if 

(a)  ai+u  -bi>t 

(b)  bi+it  -a,>i 
for  all  i  <  m  —  It. 
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Proof:  Let  {A[,B[),  {A'„  B'^),  ...  =  {A,,  B,),  5,+/),  (^^+2/,  B,+2,),  ■  •  •  be  the 

nets  assigned  to  layer  [j  mod  /)  +  1,  ;  <  /•  So,  a\  =  aj+(,-i)/  and  6-  = 
Hence, 

a'+i  -     =  aj+(i+t-i)/  - 
>  i  (from  (a)) 

Similarly,  6-+^  -  a-  >  So,  the  layer  assignment  satisfies  the  conditions  of  Theorem  1 
and  is  river  routable  using  t  tracks.  ^ 

Theorem  i  If  (a)  a,+,t  -  6.  >  <  and  (b)  bi+n  -  a.  >  <  for  all  i  <m-  It,  then  procedure 
Greedy  assigns  nets  to  layers  such  that  the  assignment  to  each  layer  is  routable  using 
t  tracks. 

Proof:  If  procedure  Greedy  is  able  to  assign  each  of  the  m  nets  to  a  layer,  then  the 
layer  assignments  satisfy  the  conditions  of  Theorem  1  and  so  are  routable  using  t 
tracks.  Suppose  the  algorithm  fails  while  trying  to  assign  net  {Ar,Br)  to  a  layer.  At 
this  time  nets  {Ai,Bi),  1  <  i  <  r,  have  been  assigned  to  layers  so  as  to  satisfy  the 
conditions  of  Theorem  1  and  the  assignment  of  net  {Ar,Br)  to  each  of  these  layers 
violates  these  conditions.  Consider  first  those  layers,  L^,  on  which  condition  (a)  is 
violated.  For  a  layer  s  G  La,  suppose  that  the  assigned  nets  are  (y4^_j,5j_J, 
Let  {A'j,B'^)  =  {Ar,Br).  Since  s  e  La,  we  have  «;  -  b'^_,  <  t. 
Now,  if  br-u  >  b'j_t,  then  Or  —  K-u  <  t  which  violates  condition  (a)  of  this  theorem. 
So,  br-it  <  Since,  b^-u  <         <  ...  <  b'^_^  <  t  of  the  It  -  1  nets 
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{Ar-tt+i,Br-iw),  •  •  • ,  {Ar-uBr-i)  havc  bccn  assigned  to  layer  5  €  La.  Consequently, 
the  layers  in  La  account  for  t\La\  ol  these  It  -  1  nets. 

In  a  similar  way,  we  can  show  that  the  remaining  /  -  \La\  layers  account  for 
another  t{l-\La\)  of  these  nets.  This  gives  us  a  total  of  tl  nets,  whereas  we  had  only 
tl-l.  This  contradiction  implies  that  procedure  Greedy  cannot  fail  unless  conditions 
(a)  and  (b)  are  not  satisfied.  '-' 

Procedure  RoundRobin  is  easily  seen  to  have  complexity  of  0(m).  A  straightfor- 
ward implementation  of  procedure  Greedy  will  have  complexity  of  0(m/).  However, 
by  using  priority  search  trees  [18]  the  complexity  can  be  reduced  to  O(mlog/).  In 
practice,  since  /  is  quite  small,  it  is  unlikely  that  the  priority  search  tree  implemen- 
tation will  run  faster  than  the  straightforward  implementation  in  which  the  /  layers 
are  checked  in  sequence.  The  actual  routing  for  all  /  layers  can  be  done  in  0{mt) 
time  using  the  computed  layer  assignment  and  the  single  layer  routing  algorithm  of 
Leiserson  and  Pinter  [15]. 

Using  Theorem  2,  we  can  develop  a  linear  time  algorithm  to  determine  the 
minimum  number  of  tracks  needed  to  route  an  instance  in  /  layers  as  well  as  to  find 
the  minimum  number  of  layers  needed  for  a  t  track  routing.  The  algorithm  for  the 
former  is  given  in  Figure  2.4.  This  figure  also  shows  the  changes  needed  in  case  t  is 
given  and  we  wish  to  determine  the  minimum  number  of  layers.  The  correctness  of 
the  algorithm  follows  from  that  of  Theorem  2  and  the  fact  that  t  (or  /)  is  increased 
only  if  the  current  t  (/)  is  found  to  be  infeasible.  The  complexity  is  0(m)  as  neither 
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procedure  MinimizeTracks  ;  {or  MinimizeLayers} 

{  Determine  the  minimum  number  of  tracks  per  layer  (or  minimum  number 

of  layers)  needed  for  multilayer  river  routing  } 
begin 

t:=0;  {or  /  :=  1} 

I  :=  1  ; 

while  (t  <  m  —  It)  do 

if  (a,+/t  -  bi  <  t)  or  (fe,+/t  -  a,  <  t) 
then  t:=t  +  I  {or  /  :=  /  +  1} 
else  i  :=  i  +  1  ; 

end  ; 

Figure  2.4.  Minimizing  the  number  of  tracks  or  layers 
i  nor  t  (/)  can  exceed  m.  So,  neither  clause  of  the  if  statement  can  be  executed  more 
than  m  —  1  times. 

Using  the  multilayer  river  routing  results  of  Baratz  [3],  one  can  trivially  extend 
all  the  results  of  Lim,  Cheng  and  Sahni  [17]  to  the  case  of  multilayer  joining  of 
compacted  cells.  So,  the  multilayer  minimum  area  join  of  two  compacted  cells  with 
m  nets  can  be  obtained  in  O(m^)  time.  If  we  wish  to  minimize  the  maximum  wire 
length  while  keeping  area  minimum,  the  asymptotic  time  complexity  is  still  O(m'). 
The  total  wire  length  can  be  minimized  while  keeping  area  minimum  in  0(m^  logm) 
time. 

In  HV  style  routing,  each  routing  layer  is  assigned  a  routing  direction  (either 
H  or  V).  In  an  H  (V)  layer  only  horizontal  (vertical)  wire  segments  can  be  laid  out. 
Horizontal  segments  on  one  layer  connect  to  vertical  segments,  of  the  same  net,  on 
another  layer  by  means  of  vias.  In  the  case  of  river  routing  instances,  one  can  see 
that  there  is  no  advantage  to  having  more  than  two  V-layers  (i.e.,  two  V-layers  are 
sufficient  to  route  all  river  routing  instances). 
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Let  RR(/,0  be  the  set  of  all  river  routing  instances  that  can  be  routed  in  / 
layers,  using  t  tracks  per  layer  and  using  river  routing  in  each  layer.  Let  HV(/,0  be 
all  river  routing  instances  that  can  be  routed  using  HV  style  routing,  /  layers,  and 
t  tracks  per  layer.  Note  that  HV(/,<)  includes  instances  routable  with  0,  1,  and  2 
V-layers.  Let  HVV(/,0  be  all  river  routing  instances  using  HV  style  routing,  /  -  2 
H-layers,  and  2  V-layers.  Theorems  5  and  6  below  hold  for  both  the  knock-knee  [25] 
and  directional  HV  models.  Theorem  7  holds  only  for  the  directional  model. 

Theorem  5  HV(/,<)  C  RR(/,0  for  every  I  >  I  and  every  t>\. 

Proof:  HV(/,  t)  C  RR(/,  t)  follows  from  a  more  general  result  obtained  by  Baratz  [3]. 
Baratz  [3]  has  shown  that,  for  river  routing  instances,  there  is  no  advantage  to  using 
any  routing  scheme  that  wires  a  net  on  more  than  one  layer.  Since  it  is  easy  to 
construct  river  routing  instances  X  such  that  X  G  RR(/,0  and  X  i  HV(/,<),  it 

follows  that  HV(/,0  C  RR(/,0- 

We  provide  a  simpler  proof  of  HV(/,<)  C  RR{l,t).  This  proof  will  also  establish 
our  next  result.  We  shall  show  that  if  X  is  a  river  routing  instance  such  that  X  i 
RR(/,0,  then  X  i  HV(/,0-  Hence,  HV(/,0  C  RR(/,0. 

Suppose  that  X  i  RR(/,0-  From  Theorem  2,  it  follows  that  a.+/t  -  b,  <  i  or 
h^it  -ai  <  t  for  some  i.   Suppose  that  a.+/,  -  6.  <  ^  (the  proof  is  similar  when 

_  ai  <  t).  So,  a,+it  <  b,  +  t.  Since  X  is  a  river  routing  instance,  at  least  nets 
i  +  t,...,i  +  It  intersect  a  vertical  cut  line  drawn  at  Oi+u.  Hence,  the  density  of  X 
at  Oi+it  \s>i  +  lt-{i  +  t)  +  l  =  {l-  \)t  +  1.  When  HV  style  routing  is  used  with 
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/,/>!,  layers,  at  most  /  -  1  layers  are  available  for  horizontal  routes.  With  t  tracks 
per  layer,  densities  of  at  most  (/  -  l)t  can  be  accommodated.  So,  X  ^  HV(/,<).  □ 

Theorem  6   HVV(/,0  C  RR(/-  1,<)  for  every  I  >  2  and  every  t>l. 

Proof:  As  in  Theorem  5,  suppose  that  X  ^  RR(/-1,0-  Let  i  be  such  that  a,+(,_i),- 
b,  <  t.  The  net  density  at  a.+(,_i),  is  >  (/  -  2)f  +  1.  In  HVV  routing,  two  layers 
are  V-layers.  So  only  /  -  2  layers  are  available  for  horizontal  segments.  This  is  not 
enough  as  the  horizontal  segment  density  is  >  (/  -  2)t  +  1  at  a.+(/-i)t.  Hence,  X  i 
HVV(/,<)- 

One  may  easily  construct  river  routing  instances  that  are  in  RR(/  -  1,  <)  but  not 
inHVV(/,0-  ° 
Theorem  7  RR(2,0     HV(/,0  -  HVV(/,0  for  every  I  >  I  and  every  t>l. 

Proof:  Consider  the  RR  instance  (ai,fei)  =  (1,2)  and  (02,^)  =  (2,3).  This  is  in 
RR(2,0  for  every  <  >  1  but  is  not  in  HV(/,<)  -  HVV(/,0  for  any  /.  □ 

As  remarked  earlier.  Theorem  7  holds  only  for  the  directional  model.  For  the 
knock-knee  model,  one  can  show  that  RK{l,t)  C  HV(/  +  1,0  for  every  /  >  1  and 
every  t  >  I. 


2.3    Constraint  Graph  Representation 

Lim  [16]  has  proposed  the  use  of  a  constraint  graph  to  determine  the  terminal 
positions  in  a  row  of  compacted  cells.  This  is  for  the  case  when  the  number  of  tracks 
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in  each  routing  channel  is  given  and  we  wish  to  minimize  the  layout  height.  In  the 
constraint  graph,  each  cell  is  represented  by  a  directed  chain  of  vertices.  Each  cell 
terminal  is  represented  by  a  vertex.  The  exception  is  when  a  compacted  cell  has 
terminals  at  the  same  j/  position  on  both  sides  of  the  cell.  In  this  case,  the  two 
terminals  at  the  same  y-position  are  represented  by  a  single  vertex.  The  vertex  chain 
is  linked  in  the  direction  of  increasing  y-position.  The  chain  edges  are  labeled  by  the 
minimum  allowable  terminal  separation.  In  addition,  the  constraint  graph  contains 
a  source  vertex  that  represents  the  bottom  of  the  layout  and  a  sink  vertex  that 
represents  the  layout  top.  The  source  vertex  connects  to  the  bottom  of  each  chain 
and  the  top  of  each  chain  is  connected  to  the  sink  vertex. 

Figure  2.5(b)  shows  the  chains  (solid  edges)  for  the  four  cell  row  of  Figure  2.5(a). 
To  complete  the  constraint  graph,  directed  edges  are  added  to  introduce  the  channel 
routing  constraints  of  Theorem  2.  These  are  represented  by  the  broken  edges  of 
Figure  2.5(b).  Figure  2.5(b)  is  for  the  two  layer  case. 

Lim  [16]  has  shown  that  the  constraint  graph  is  acyclic  provided  the  number 
of  tracks  in  each  routing  channel  is  >  0.  He  has  proposed  handling  channels  with 
zero  tracks  by  finding  first  the  minimum  area  joining  of  the  adjacent  cells  (only  cell 
stretching  is  permitted  now)  and  then  combining  these  two  cells  into  one.  I.e.,  the  two 
cells  are  replaced  by  their  minimum  area  join.  This  strategy  can  be  shown  to  result 
in  non-optimality  of  the  algorithm  proposed  in  Lim  [16].  To  preserve  optimality,  it 
is  necessary  to  merge  the  vertices  that  represent  terminals  that  are  the  endpoints  of 
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sink 


(b)  Constraint  graph  representation 
Figure  2.5.  Constraint  graph  representation 

nets  that  are  to  be  routed  using  no  tracks  as  in  Figure  2.6.  The  resultant  constraint 
graph  is  also  acyclic. 

It  is  easy  to  see  that  the  number  of  vertices  and  edges  in  the  constraint  graph 
is  0(n)  where  n  is  the  total  number  of  terminals.  Furthermore,  the  graph  can  be 
constructed  in  0(n)  time  given  the  number  of  routing  layers  and  the  number  of  tracks 
in  each  channel.  The  constraint  graph  described  by  us  is  identical  to  that  of  Lim  [16] 
except  in  the  way  channels  with  zero  tracks  are  handled  and  in  that  our  graph  is 
defined  for  /  >  1  routing  layers  while  that  of  Lim  [16]  is  only  for  /  =  1. 
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sink 


source 

Figure  2.6.  Merge  in  constraint  graph 


The  length  of  the  longest  path  from  the  source  vertex  of  the  constraint  graph 
to  each  of  the  remaining  vertices  can  be  computed  in  0(n)  time  by  doing  this  in 
topological  order  [14,  Section  6.5].  It  is  easy  to  see  that  if  each  terminal  is  placed 
at  a  vertical  position  given  by  the  longest  path  length  from  the  source,  then  all  nets 
can  be  routed  in  the  given  number  of  tracks  (as  the  conditions  of  Theorem  2  are 
satisfied  in  each  routing  channel).  Furthermore,  Lim  [16]  has  shown  that  such  a 
positioning  of  terminals  results  in  a  stretched  layout  of  minimum  height  for  the  given 
channel  widths.  As  a  result,  when  channel  widths  are  known,  cells  can  be  stretched 
to  minimize  area  in  0(n)  time.  The  channel  widths  that  result  in  minimum  area  can 
be  determined  in  0{n{n/cy-^)  time  where  c  is  the  number  of  cells  by  trying  out  all 
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procedure  Heuristicl  ; 
begin 

for  I  :=  1  to  c  -  1  do 

begin 

determine  the  minimum  area  join  of  each  pair  of  adjacent  cells  ; 
select  the  pair  that  has  minimum  area  and  replace  it  with  its 
minimum  area  join  ; 
end  ; 
end  ; 

Figure  2.7.  Heuristic  1 
possible  channel  widths  [16].  Since  this  is  feasible  only  for  small  c,  we  propose  several 
heuristics  in  the  next  section. 

2.4    Heuristics  to  Minimize  Area 
We  formulate  three  greedy  heuristics  to  obtain  the  minimum  area  join  of  a  row 
of  c  compacted  cells  that  have  a  total  of  n  terminals. 

2.4.1    Heuristic  1 

The  heuristic  is  described  in  Figure  2.7. 

In  each  iteration  of  the  for  loop  we  examine  every  pair  of  adjacent  cells.  For 
each  pair,  the  minimum  area  join  is  found  using  the  algorithm  of  Lim,  Cheng  and 
Sahni  [17]  extended  to  the  multilayer  case  as  discussed  in  Section  2.2.  The  pair  which 
has  the  minimum  area  join  is  replaced  by  a  single  cell  that  represents  this  join.  So 
following  each  iteration  of  the  for  loop  the  number  of  cells  decreases  by  one.  When 
the  for  loop  terminates,  we  are  left  with  a  single  cell  that  represents  the  join  of 
all  c  cells.  The  time  needed  to  determine  the  minimum  area  join  of  a  pair  of  cells 
with  n,  nets  between  them  is  0{nf).  The  time  to  do  this  for  all  pairs  of  adjacent 
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cells  is  0{Z,n])  =  0{n^).  So,  the  for  loop  iteration  with  i  =  1  takes  0{n'^)  time. 
On  subsequent  iterations,  only  the  two  pairs  that  include  the  cell  introduced  in  the 
previous  iteration  need  to  have  their  minimum  area  join  computed.  Since  each  cell 
pair  being  considered  includes  at  least  one  composite  cell,  the  minimum  area  join 
is  computed  by  considering  the  portion  of  the  constraint  graph  that  represents  all 
the  basic  cells  in  the  cell  pair.  Channel  widths  for  channels  within  a  composite  cell 
are  not  changed  while  obtaining  the  minimum  area  join  of  the  cell  pair.  However, 
as  different  channel  widths  for  the  channel  between  the  two  (composite)  cells  being 
joined  are  tried,  the  constraint  graph  is  used  to  determine  the  minimum  height  of  the 
combined  cell.  So,  the  time  to  combine  two  (composite)  cells  with  n,  terminals  in  the 
channel  between  them  is  O(nni).  Hence  the  time  for  the  remaining  c  -  2  iterations 
is  0(nE,  ".)  =  O(n^).  The  overall  complexity  of  Heuristic  1  is  therefore  0{n'^).  In 
case  the  terminals  are  uniformly  distributed  over  the  cells,  n,  =  0(n/c)  for  all  t.  The 
time  for  the  first  iteration  of  the  for  loop  is  now  O(nVc)  and  that  for  each  of  the 
remaining  iterations  is  0(n^/c).  The  overall  time  is  O(n^). 

2.4.2    Heuristic  2 

In  this  heuristic,  we  begin  by  assigning  each  channel  the  number  of  tracks  needed 
to  route  the  channel  with  no  cell  stretching.  This  number  can  be  determined  in  0(n,) 
time  for  a  channel  with  n.  nets  as  described  in  Section  2.2.  The  time  taken  to  do 
this  for  all  c  -  1  channels  is  0(n).  The  configuration  obtained  in  this  way  is  the 
maximum  width  layout.  Starting  from  this  configuration,  we  reduce  the  total  number 
of  tracks  available  across  all  c  -  1  channels  by  one  on  each  iteration.  For  this,  the 
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procedure  Heuristic2  ; 
begin 

for  each  channel  determine  the  number  of  tracks, 
needed  to  route  with  no  stretching,  1  <  i  <  c  ; 

set  up  the  constraint  graph  using  ti  tracks  in  channel  i,l  <t  <  c  ; 

compute  layout  area,  A  ; 

for  tracks  :=  t  downto  1  do  {reduce  by  1} 

begin 

for  I  :=  1  to  c  —  1  do 

begin 

reduce  the  number  of  tracks  in  channel  i  by  1  ; 
modify  the  constraint  graph  to  reflect  this  ; 
determine  the  length  of  the  longest  path  in  the  graph 
and  from  this  the  layout  area,  a,  ; 
end  ; 

select  j  such  that  Cj  =  min{  Oi  }  ; 
reduce  the  number  of  tracks  in  channel  j  by  1  ; 
A  =  min{  A,  aj  }  ; 
end  ; 
end  ; 

Figure  2.8.  Heuristic  2 
effect  of  a  one  track  reduction  is  computed  for  each  channel.  The  minimum  layout 
height  is  determined  by  computing  the  length  of  the  longest  path  in  the  constraint 
graph  of  Section  2.3.  The  track  reduction  is  done  in  the  channel  that  results  in  the 
smallest  layout  height  (hence  the  minimum  area  for  the  given  number  of  tracks).  The 
algorithm  is  stated  more  formally  in  Figure  2.8. 

When  the  algorithm  terminates,  A  is  the  area  of  the  minimum  area  join  found 
by  the  heuristic.  To  reconstruct  the  layout,  it  is  necessary  to  store  the  tracks  per 
channel  each  time  A  is  updated  in  the  statement  A  =  min{  A^a^  }. 

For  the  time  complexity,  we  see  that  the  steps  that  precede  the  outer  for  loop 
take  0(n)  time.  Each  iteration  of  the  outer  loop  takes  0(nc)  time.  Hence  this  loop 
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contributes  a  total  of  0{nct)  to  the  time.  Since  t  =  0(n),  the  overall  time  complexity 
of  Heuristic  2  is  0{n'^c). 
2.4.3    Heuristic  3 

Unlike  Heuristic  2  which  attempts  to  minimize  the  layout  height  for  each  value 
of  t,  the  total  number  of  tracks,  Heuristic  3  attempts  to  minimize  the  width  (i.e., 
total  number  of  tracks)  for  each  choice  of  layout  height.  The  heuristic  begins  with  a 
layout  height,  hi,  equal  to  the  height  of  the  tallest  compacted  cell.  At  each  iteration, 
the  next  layout  height  to  use  is  computed  as  described  later.  During  each  iteration, 
cells  are  combined  in  groups  of  at  most  k  {k  >  I  \s  a.  parameter  to  the  heuristic). 
Each  group  of  combined  cells  is  replaced  by  its  minimum  area  join  subject  to  the 
constraint  that  the  height  of  the  join  does  not  exceed  ht.  This  joining  of  <  fc  cells  at 
a  time  continues  until  only  one  cell  remains.  Its  area  is  computed  and  recorded.  The 
minimum  area  obtained  over  all  heights  tried  is  then  reported  as  the  best.  Heuristic 
3  is  given  in  Figure  2.9. 

In  our  implementation  of  Heuristic  3,  the  minimum  area  join  of  k  cells  is  found 
by  considering  the  portion  of  the  constraint  graph  for  all  the  basic  cells  included  in 
these  k  cells.  So,  for  this  purpose  composite  cells  are  not  handled  as  single  cells. 
Rather,  as  in  Heuristic  1,  the  basic  cells  they  are  composed  of  are  considered  and 
channel  widths  previously  assigned  to  the  associated  channels  are  not  changed.  Track 
assignment  is  done  only  for  the  -  1  channels  between  the  k  composite  cells.  We 
found  this  to  give  better  results  than  when  composite  cells  were  regarded  as  atomic. 
For  the  case  k  =  2,  the  minimum  area  is  determined  by  a  binary  search  over  the 
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procedure  HeuristicS  ; 
begin 

ht  :-  height  of  the  tallest  cell  ; 

repeat  {  minimize  width  subject  to  height  <  ht  } 
repeat  {  do  this  by  combining  k  cells  at  a  time  } 

select  k  adjacent  cells  such  that  the  minimum  height  cell  is  selected 
and  the  height  of  the  tallest  selected  cell  is  minimum 
(if  there  are  fewer  than  k  cells,  then  select  all  of  them)  ; 
obtain  the  minimum  area  layout  for  the  selected  cells  under 
the  constraint  that  the  layout  height  does  not  exceed  ht  ; 
during  the  preceding  step  record  the  next  value  of  ht 
that  is  possible  for  a  layout  ; 
until  one  cell  remains  ; 

compute  the  area  of  the  remaining  cell  and  record  it 
if  it  is  less  than  the  minimum  area  found  so  fax  ; 
if  there  is  no  next  height  then  terminate  ; 
ht  :=  next  height  ; 
until  false  ; 
end  ; 

Figure  2.9.  Heuristic  3 
number  of  tracks  in  the  single  channel.  This  takes  O(nlogn.)  time  where  n.  is  the 
number  of  nets  in  channel  i.  Thus  the  time  needed  for  the  inner  repeat  loop  when 
jfc  =  2  is  O(cnlogn)  (for  uniform  terminal  distribution  it  is  0(cn  log(n/c)).  During 
the  binary  search,  the  heights  corresponding  to  channel  widths  that  require  height 
>  ht  are  recorded.  The  minimum  of  these  heights  yields  the  next  value  of  ht. 

When  >  2,  all  track  combinations  for  the  -  1  channels  are  tried  as  in 
Section  2.3.  Again,  each  composite  cell  is  broken  up  into  its  basic  cells.  As  different 
track  combinations  are  tried,  we  record  the  minimum  height  >  ht  that  results  from 
any  track  combination.  This  gives  the  next  value  of  ht.  The  time  for  the  inner 
repeat  loop  is  0{{c/{k  -  l))n{n/k)''-')  (or  0((c/(i  -  l))n(n/c)*-^)  when  terminals 
are  uniformly  distributed). 
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In  all  our  experiments,  the  outer  repeat  loop  was  iterated  fewer  than  {k  -  l)n 
times.  To  ensure  that  the  number  of  iterations  \sO{kn),  one  may  adopt  the  following 
scheme.  When  the  number  of  iterations  first  reaches  {k  -  compute  a  set  of  at 
most  n  new  heights  by  beginning  with  the  current  constraint  graph.  This  uses  the 
current  assignment  of  number  of  tracks  in  each  channel.  Heuristic  2  is  next  used  to 
reduce  the  total  number  of  available  tracks  by  one  and  determine  the  height  needed 
to  complete  the  routing  with  the  reduced  number  of  tracks.  This  process  gives  us 
at  most  n  new  heights  hi  <  <  . . .  <  hp.  Heuristic  3  is  now  resumed  with  hi  as 
the  next  height.  Only  two  iterations  are  performed.  Then  Heuristic  3  is  resumed 
with  max{  h2,ht}  as  the  next  height.  Again  two  iterations  of  the  outer  repeat  loop 
are  done.  Next  the  heuristic  is  resumed  with  max{  hz.ht  }  as  the  next  height.  This 
continues  until  we  have  gone  through  p  resumptions  of  the  heuristic.  With  this  scheme 
to  limit  the  number  of  iterations,  the  complexity  of  Heuristic  3  becomes  0(cn^  log  n) 
when  )t  =  2  and  0((c/(ik  -  \))kn\nlkf-')  =  Oicn^in/k)'-')  when  k  >  2.  For  the 
case  when  the  n  terminals  are  uniformly  distributed  over  the  c  cells,  the  complexity 
is  0(cnMog(n/c))  when  A;  =  2  and  0{{c/{k  -  l))kn{n/c)''-')  =  0(cn(n/c)'=-i)  ^^^^ 
k  >  2.  One  may  verify  that  since  Heuristic  3  tries  the  maximum  useful  height  (i.e., 
the  height  needed  when  no  routing  tracks  are  available),  it  generates  optimal  solutions 
when  k  =  c. 


2.5    Experimental  Results 

We  programmed  our  three  heuristics  as  well  as  the  heuristic  Fang  [8]  in  C  and 
ran  tests  on  a  single  KSR  processor.  Optimal  solutions  for  instances  with  up  to  nine 
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cells  were  obtained  using  the  corrected  version  of  the  exhaustive  search  algorithnn  of 
Lim  [16].  Our  test  set  consisted  of  instances  that  had  a  number  of  cells,  c,  equal  to 
one  of  the  numbers  in  the  set  {3,  . . .,  9,  10,  20,  50,  100}.  For  each  value  of  c,  there 
were  twenty  instances  and  the  results  were  averaged  over  these  instances.  An  instance 
with  c  cells  had  c  -  1  routing  channels.  The  number,  t,  of  terminals  on  either  side  of 
each  routing  channel  was  equal  to  c  for  3  <  c  <  9  and  was  10  for  the  other  values  of 
c.  In  addition,  when  c  =  100,  we  also  had  instances  with  20  terminals  on  either  side. 
In  our  experiments,  we  considered  only  single  layer  and  two  layer  routing. 

Table  2.1  gives  the  average  percentage  by  which  the  area  of  the  single  layer 
solutions  generated  by  each  of  the  heuristics  exceeded  the  area  of  the  single  layer 
optimal  solution.  As  is  evident,  each  of  the  heuristics  proposed  in  this  chapter  gave 
noticeably  better  solutions  than  did  Fang.  This  table  is  only  for  the  cases  3  <  c  <  9 
as  for  c  >  9  the  optimal  algorithm  of  Lim  [16]  required  too  much  time  to  complete. 
When  k  >  c,  Heuristic  3  is  guaranteed  to  generate  an  optimal  solution.  So,  we  did 
not  run  these  cases. 

In  table  2.2,  we  have  used  the  single  layer  solution  produced  by  Fang  as  the 
benchmark  against  which  the  solutions  obtained  by  our  three  heuristics  are  compared. 
This  table  gives  the  average  percentage  by  which  the  area  of  the  solutions  produced 
by  our  heuristics  is  less  than  that  of  the  solutions  produced  by  Fang.  Our  solutions 
have  area  9  to  18%  less. 

Table  2.3  compares  the  computing  time  requirements  of  the  various  algorithms 
for  the  case  of  one  layer.  The  optimal  algorithm  is  useful  only  for  small  values  of  c 


Table  2.1.  Error  rate  (%)  over  optimal,  /  =  1 


cells 

t 

Fang 

Heuristic 
1 

Heuristic 
2 

Heuristic3 

k  =  2 

k  =  3 

k  =  i 

3 

3 

5.9 

0.5 

0.2 

0 

* 

* 

4 

4 

10.0 

0.9 

0.1 

0 

0 

* 

5 

5 

11.0 

3.7 

0.3 

0.3 

0.1 

0.1 

6 

6 

12.7 

2.4 

0.3 

0.2 

0.2 

0.0 

7 

7 

16.1 

3.5 

0.3 

0.1 

0.1 

0.1 

8 

8 

17.9 

2.7 

0.4 

0.3 

0.3 

0.1 

9 

9 

18.8 

3.5 

0.3 

0.4 

0.3 

0.1 

t  =  number  of  terminals  on  each  side  of  each  routing  channel 

♦  :   k  >  c 


Table  2.2.  Improvement  (%)  over  Fang,  /  =  1 


cells 

t 

Heuristic 
1 

Heuristic 
2 

Heuristic3 

k  =  2 

k  =  3 

it  =  4 

10 

10 

9.4 

14.0 

14.0 

14.2 

14.3 

20 

10 

10.5 

15.4 

15.4 

15.6 

15.6 

50 

10 

10.6 

16.1 

16.1 

16.2 

100 

10 

9.1 

16.5 

16.5 

16.6 

100 

20 

9.3 

18.3 

18.0 

-  :   excessive  run  time 
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Table  2.3.  Time  taken,  /  =  1 


cells 

t 

Fang 

Heuristic 
1 

Heuristic 
2 

Heuristic 

3 

Optimal 

k  =  2 

ifc  =  3j 

it  =  4 

3 

3 

0.0 

0.01 

0.02 

0  02 

* 

* 

0.01 

4 

4 

0.0 

0.02 

0.02 

0.04 

0.09 

* 

0.05 

5 

5 

0.0 

0.03 

0.04 

0.09 

0.36 

1.2 

0.77 

6 

6 

0.0 

0.03 

0.08 

0.21 

0.66 

3.7 

13 

7 

7 

0.0 

0.04 

0.14 

0.50 

3.4 

27 

278 

8 

8 

0.0 

0.07 

0.24 

0.84 

4.4 

34 

1.8t 

9 

9 

0.0 

0.09 

0.42 

1.8 

16 

72.3 

47' 

10 

10 

0.0 

0.14 

0.78 

3.1 

19 

334 

20 

10 

0.01 

0.32 

5.8 

16 

117 

1196 

50 

10 

0.01 

1.4 

93 

127 

896 

100 

10 

0.03 

4.4. 

782 

590 

4873 

100 

20 

0.06 

11 

3022 

4087 

Times  are  in  seconds, 
f  :  Times  are  in  hours. 


(say  up  to  7).  While  Fang  is  significantly  faster  than  the  heuristics  proposed  here, 
the  quality  of  the  solutions  generated  by  our  heuristics  is  superior. 

Table  2.4  is  the  analog  of  table  2.1  for  the  case  of  two  layers.  Again,  our  heuristics 
performed  considerably  better  than  did  Fang.  Table  2.5  gives  the  improvement  in 
area  due  to  increasing  the  number  of  routing  layers  from  one  to  two.  This  is  influenced 
somewhat  by  the  width  of  cells  which  in  our  case  ranged  from  5  to  30  times  the  track 
separation.  With  narrower  cells,  the  impact  of  the  second  layer  would  have  been 
greater  and  with  wider  cells,  it  would  have  been  less.  Also,  the  impact  of  the  second 
layer  is  more  when  more  routing  tracks  are  needed.  For  the  smaller  instances  of 
table  2.4,  for  example,  the  optimtil  solutions  with  /  =  2  required,  on  average,  only 
1.8%  less  area  than  when  /  =  1. 


Table  2.4.  Error  rate  (%)  over  optimal,  /  -  2 


cells 

t 

Fang 

Heuristic 
1 

Heuristic 
2 

Heuristic3 

jb  =  2 

=  3 

k  =  4 

3 

3 

6.3 

0.5 

0 

0 

* 

* 

4 

4 

9.8 

1.1 

0.1 

0 

0 

* 

5 

5 

11.0 

2.7 

0.1 

0.2 

0.1 

0.0 

6 

6 

13.0 

1.4 

0.1 

0.0 

0.0 

0.1 

7 

7 

15.9 

2.3 

0.1 

0 

0 

0 

8 

8 

16.8 

1.7 

0.2 

0 

0.1 

0.0 

9 

9 

18.4 

2.2 

0.1 

0.1 

0.1 

0.0 

Table  2.5.  Improvement  (%)  over  /  =  1  cases 


cells 

t 

Fang 

Heuristic 
1 

Heuristic 
2 

Heuristic3 

k  =  2 

k  =  3 

A:  =  4 

10 

10 

4.1 

4.7 

3.3 

3.3 

3.0 

3.0 

20 

10 

4.1 

5.8 

3.4 

3.4 

3.3 

3.2 

50 

10 

4.6 

5.2 

3.4 

3.2 

3.2 

100 

10 

4.7 

6.3 

3.5 

3.4 

3.4 

100 

20 

7.1 

10.4 

4.8 

5.2 

30 


Table  2.6.  Improvement  (%)  over  Fang,  /  =  2 


cells 

t 

Heuristic 
1 

Heuristic 
Z 

Heuristics 

k  =  2 

k  =  3 

it  =  4 

10 

10 

10.0 

13.3 

13.2 

13.2 

13.3 

20 

10 

12.2 

14.8 

14.8 

14.9 

14.8 

50 

10 

11.2 

15.1 

14.9 

15.0 

100 

10 

10.7 

15.4 

15.4 

15.4 

100 

20 

12.5 

16.3 

16.3 

simi 


Table  2.6  is  the  analog  of  table  2.2  for  the  case  of  two  layers.  The  results  are 
lar  to  those  in  table  2.2.  Table  2.7  gives  the  average  computing  times  for  the  two 
layer  instances.  These  are  less  than  for  the  one  layer  case  as  the  constraint  graph  has 
fewer  edges. 

For  large  c,  we  recommend  the  use  of  heuristic  2  or  3  (with  k  =  2)  and  for  small 
c  we  recommend  using  heuristic  3  (with    =  3  or  4). 


2.6  Conclusion 

We  have  considered  the  problem  of  joining  a  row  of  compacted  cells  and  de- 
loped  heuristics  to  stretch  cells  and  river-route  the  nets  so  that  the  layout  area  is 
ized.  Our  proposed  heuristic  was  compared,  experimentally,  with  Fang  [8]  and 
found  to  produce  layouts  with  less  area.  However,  Fang  is  faster.  We  recommend  the 
use  of  our  Heuristic  3  with  /:  =  3  or  4  in  practice. 


ve 


mmimi2 
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Table  2.7.  Time  taken,  /  =  2 


cells 

t 

Fang 

Heuristic 
1 

Heuristic 
2 

Heuristic3 

Optimal 

it  =  2 

jt  =  3 

it  =  4 

3 

3 

0.0 

0.01 

0.02 

0.02 

* 

0.0 

4 

4 

0.0 

0.02 

0.02 

0.03 

0.06 

* 

0.04 

5 

5 

0.0 

0.02 

0.02 

0.07 

0.25 

0.75 

0.60 

6 

6 

0.0 

0.02 

0.04 

0.13 

0.38 

2.0 

11 

7 

7 

0.0 

0.04 

0.06 

0.29 

1.9 

14 

223 

8 

8 

0.0 

0.06 

0.12 

0.53 

2.5 

19 

1.5t 

9 

9 

0.0 

0.06 

0.20 

0.96 

7.5 

34 

39^ 

10 

10 

0.01 

0.09 

0.32 

1.7 

9.9 

153 

20 

10 

0.01 

0.24 

2.7 

9.9 

65 

651 

50 

10 

0.01 

1.1 

42 

85 

586 

100 

10 

0.02 

3.7 

357 

472 

3069 

100 

20 

0.05 

7.9 

1377 

3166 

Times  are  in  seconds. 
I  :  Times  are  in  hours. 


CHAPTER  3 

A  NEW  WEIGHT  BALANCED  BINARY  SEARCH  TREE 
3.1  Introduction 

A  dictionary  is  a  set  of  elements  on  which  the  operations  of  search,  insert,  and 
delete  are  performed.  Many  data  structures  have  been  proposed  for  the  efficient 
representation  of  a  dictionary  [14].  These  include  direct  addressing  schemes  such  as 
hash  tables  and  comparison  schemes  such  as  binary  search  trees,  AVL-trees,  red/black 
trees  [12],  trees  of  bounded  balance  [21],  treaps  [1],  deterministic  skip  lists  [20],  and 
skip  lists  [26].  Of  these  schemes,  AVL-trees,  red/black  trees,  and  trees  of  bounded 
balance  (WB(a))  are  balanced  binary  search  trees.  When  representing  a  dictionary 
with  n  elements,  using  one  of  these  schemes,  the  corresponding  binary  search  tree 
has  height  O(logn)  and  individual  search,  insert,  and  delete  operations  take  O(logn) 
time.  When  (unbalanced)  binary  search  trees,  treaps,  or  skip  lists  are  used,  each 
operation  has  an  expected  complexity  of  O(log  n)  but  the  worst  case  complexity  is 
0(n).  When  hash  tables  are  used,  the  expected  complexity  is  0(1)  per  operation. 
However,  the  worst  case  complexity  is  0(n).  So,  in  applications  where  a  worst  case 
complexity  guarantee  is  critical,  one  of  the  balanced  binary  search  tree  schemes  is  to 
be  performed. 
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In  this  chapter,  we  develop  a  new  balanced  binary  search  tree  called  ^-BBST  {/3- 
balanced  binary  search  tree).  Like  WB(a)  trees,  this  achieves  balancing  by  controlling 
the  relative  number  of  nodes  in  each  subtree.  However,  unlike  WB(q;)  trees,  during 
insert  and  delete  operations,  rotations  are  performed  along  the  search  path  whenever 
they  reduce  the  internal  path  length  of  the  tree  (rather  than  only  when  a  subtree  is 
out  of  balance).  As  a  result,  the  constructed  trees  are  expected  to  have  a  smaller 
internal  path  length  than  the  corresponding  WB(a)  tree.  Since  the  average  search 
time  is  closely  related  to  the  internal  path  length,  the  time  need  to  search  in  a  /?-BBST 
is  expected  to  be  less  than  that  in  a  WB(a)  tree. 

In  Section  3.2,  we  define  the  total  search  cost  of  a  binary  search  tree  and  show 
that  the  rebalancing  rotations  performed  in  AVL  and  red/black  trees  might  increase 
this  metric.  We  also  show  that  while  similar  rotations  in  WB(a)  trees  do  not  increase 
this  metric,  insert  and  delete  operations  in  WB(a)  trees  do  not  avail  of  all  oppor- 
tunities to  reduce  the  metric.  In  Section  3.3,  we  define  /3-BBSTs  and  show  their 
relationship  to  WB(a)  trees.  Search,  insert,  and  delete  algorithms  for  ^-BBSTs  are 
developed  in  Section  3.4.  A  simplified  version  of  ^-BBSTs  is  developed  in  Section  3.5. 
Search,  insert  and  delete  operations  for  this  version  also  take  O(log  n)  time  each.  An 
even  simpler  version  of  ^-BBSTs  is  developed  in  Section  3.6.  For  this  version,  we 
show  that  the  average  cost  of  an  insert  and  search  operation  is  O(logn)  provided  no 
deletes  are  performed. 

An  experimental  evaluation  of  /3-BBSTs  and  competing  schemes  for  dictionaries 
(AVL,  red/black,  skip  lists,  etc.)  was  done  and  the  results  of  this  are  presented  in 
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Section  3.7.  This  section  also  compares  the  relative  performance  of  ^-BBSTs  and  the 
two  simplified  versions  of  Sections  3.5  and  3.6. 

3.2    Balanced  Trees  and  Rotations 

Following  an  insert  or  delete  operation  in  a  balanced  binary  search  tree  (e.g., 
AVL,  red/black,  WB(a),  etc.),  it  may  be  necessary  to  perform  rotations  to  restore 
balance.  The  rotations  are  classified  as  LL,  RR,  LR,  and  RL  [14].  LL  and  RR 
rotations  as  well  as  LR  and  RL  rotations  are  symmetric.  While  the  conditions  under 
which  the  rotations  cire  performed  vary  with  the  class  of  balanced  tree  considered, 
the  node  movement  patterns  are  the  same.  Figure  3.1  shows  the  transformation 
performed  by  an  LL  and  an  LR  rotation.  In  this  figure,  nodes  whose  subtrees  have 
changed  as  a  result  of  the  rotation  axe  designated  by  a  prime.  So,  p'  is  the  original 
node  p  however  its  subtrees  are  different. 

Let  h{x)  be  the  height  of  the  subtree  with  root  x.  Let  s{x)  be  the  number  of 
nodes  in  this  subtree.  When  searching  for  an  element  i,  a:  is  compared  with  one 
element  at  each  of  l{x)  levels,  where  /(x)  is  the  level  at  which  x  is  present  (the  root 
is  at  level  1).  So,  one  measure  of  the  "goodness"  of  the  binary  search  tree,  T,  for 
search  operations  (assuming  each  element  is  searched  for  with  equal  probability)  is 
its  totcil  search  cost  defined  as: 


C{T)  =  E  /(x). 
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b  c 

(b)  LR  rotation 
Figure  3.1.  LL  and  RL  rotations 

Notice  that  C{T)  =  I{T)  +  n  where  I{T)  is  the  internal  path  length  of  T  and 
n  is  the  number  of  elements/nodes  in  T.  The  cost  of  unsuccessful  searches  is  equal 
to  the  external  path  length  E{T).  Since  E{T)  =  I{T)  +  2n,  minimizing  C{T)  also 
minimizes  E{T). 

Total  search  cost  is  important  as  this  is  the  dominant  operation  in  a  dictionary 
(note  that  insert  can  be  modeled  as  an  unsuccessful  search  followed  by  the  insertion 
of  a  node  at  the  point  where  the  search  terminated  and  deletion  can  be  modeled  by 
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a  successful  search  followed  by  a  physical  deletion;  both  operations  are  then  followed 
by  a  rebalancing/restructuring  step). 

Observe  that  in  an  actual  implementation  of  the  search  operation  in  program- 
ming languages  such  as  C++,  C,  and  Pascal,  the  search  for  an  x  at  level  /(i)  will 
involve  upto  two  comparisons  at  levels  1,2, . . .  ,/(ar).  If  the  code  first  checks  i  =  e, 
where  is  the  element  at  level  i  to  be  compared  and  then  x  <  e,  to  decide  whether  to 
move  to  the  left  or  right  subtree,  then  the  number  of  element  comparisons  is  exactly 
2/(i)  -  1.  In  this  case,  the  total  number  of  element  comparisons  is 

NC{T)  =  2      K^)  -  "  =  2C(T)  -  n 

and  minimizing  C{T)  also  minimizes  NC{T).  If  the  code  first  checks  x  <  e.  and  then 
X  =  e,  (or  >  e,),  the  number  of  element  comparisons  done  to  find  x  is  /(i)  +  r(x)  +  1 
where  r(x)  is  the  number  of  right  branches  on  the  path  from  the  root  to  x.  The 
total  number  of  comparisons  is  bounded  by  2C{T).  For  simplicity,  we  use  C{T)  to 
motivate  our  data  structure. 

In  an  AVL  tree,  when  an  LL  rotation  is  performed,  h{q)  =  h{c)-\-l  =  h{d)+\  (see 
Figure  3.1(a)).  At  this  time,  the  balance  factor  at  gp  is  h{p)  -  h{d)  =  2.  The  rotation 
restores  height  balance  which  is  necessary  to  guarantee  O(logn)  search,  insert,  delete 
operations  in  an  n  node  AVL  tree.  The  rotation  may,  however,  increase  the  total 
search  cost.  To  see  this,  notice  that  an  LL  rotation  affects  the  level  numbers  of  only 
those  nodes  that  are  in  the  subtree  with  root  gp  prior  to  the  rotation.  We  see  that 
l[q')  =  l{q)  -  1,  /(p')  =  l{p)  -  l,l{gp')  =  l{gp)  +  1,  the  total  search  cost  of  the  subtree 


37 


with  root  a  is  decreased  by  s{a)  as  a  result  of  the  rotation,  etc.  Hence,  the  increase 
in  C{T)  due  to  the  rotation  is: 

Kp')  -  Kp)  +  W  -  Kg)  +  Kdp)  -  Kgp)  -  4^)  -  4^)  +  ^(^) 

=  -1  -  1  +  1  -  s{q)  +  1  +  s{d)  =  s{d)  -  s{q). 

A  similar  analysis  shows  that  an  LR  rotation  increases  C{T)  by  s{d)  -  s{q). 

If  the  LL  rotation  was  triggered  by  an  insertion,  s{q)  is  at  least  one  more  than  the 
minimum  number  of  nodes  in  an  AVL  tree  of  height  t  =  h{q)  -  1.  So,  s{q)  >  .^'+V \^ 
where  <^  =  (1  +  v/5)/2.  The  maximum  value  for  s{d)  is  2'  -  1.  So,  an  LL  rotation 
has  the  potential  of  increasing  total  search  cost  by  as  much  as 

2'  _  1  _  cl>'+yy/5  «  2'  -  1  -  1.62'+V2.24. 

This  is  negative  for  <  <  2  and  positive  for  t  >  2.  When  t  =  10,  for  example,  an 
LL  rotation  may  increase  total  search  cost  by  as  much  as  877.  As  t  gets  larger,  the 
potential  increase  in  search  cost  gets  much  greater.  This  analysis  is  easily  extended 
to  the  remaining  rotations  eind  also  to  red/black  trees. 

Definition  (WB(a)  [21])  The  balance,  B{p),  of  a  node  p  in  a  binary  tree  is  the 
ratio  (s(/)  +  l)/{s{p)  +  1)  where  /  is  the  left  child  of  p.  For  q  G  [0, 1/2],  a  binary  tree 
T  is  in  WB(a)  iff  a  <  B{p)  <  1  -  a  for  every  node  p  in  T.  By  definition,  the  empty 
tree  is  in  WB(a)  for  all  a. 
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Lemma  1  (1)  The  maximum  height,  hmax{n),  of  an  n  node  tree  in  WB(a)  is  ~ 
log^(n  +  l)  [21] 

1  —a 

(2)  Inserts  and  deletes  can  be  performed  in  an  n  node  tree  in  WB(a)  in  O(logn)  time 
/or  2/11  <  a  <  1  -  v/2/2  [4]. 

(3)  Each  search  operation  in  an  n  node  tree  in  WB(a)  takes  0(\og  n)  time  [21]. 

In  the  case  of  weight  balanced  trees  WB(a),  an  LL  rotation  is  performed  when 
B{gp)  «  1  -  a  and  B{p)  >  a/{l  -  a)  (see  Figure  3.1(a))  [21].  So, 


s{pl+l  _       s{p)  +  l 
^     ''~5(yp)  +  l      s{p)  +  s{d)  +  2 


or 


and 


,  ,    Q        2a  -  1 

s(d)  «  s{p)-  +   

^  '  1  —  a      1  —  Q 


or 


1  -  a  -    '"^^     s{p)  +  1 


,  ^    a        2a  -  1 


So,  LL  rotations  (and  also  RR)  do  not  increase  the  search  cost.  For  LR  rotations 
[21],  Bigp)  «  1  -  a  and  B{p)  <  a/(l  -  a).  So,  s{d)  ^  s{p)^  +  ^  and  with 
respect  to  Figure  3.1(b), 


>  B{p) 


1  -  a        '"^^       s{p)  +  1 
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or 


^    1  -  2q  a 
s(q)  >  s{p)-  • 

For  Q  <  1/3,  s{q)  >  s{d)  and  LR  (RL)  rotations  do  not  increase  search  cost.  Thus, 
in  the  case  of  WB(a)  trees,  the  rebalancing  rotations  do  not  increase  search  cost. 
This  statement  remains  true  if  the  conditions  for  LL  and  LR  rotation  are  changed  to 
those  in  Blum  and  Mehlhorn  [4]. 

While  rotations  do  not  increase  the  search  cost  of  WB(a)  trees,  these  trees 
miss  performing  some  rotations  that  would  reduce  search  cost.  For  example,  it  is 
possible  to  have  a  <  B{gp)  <  1  -  «,  B{p)  >  ,  and  s{q)  >  s{d).  Since  B{gp)  isn't 
high  enough,  an  LL  rotation  isn't  performed.  Yet,  performing  such  a  rotation  would 
reduce  search  cost. 

3.3  jg-BBSTs 

Definition  A  cost  optimized  search  tree  (COST)  is  a  binary  search  tree  whose  search 
cost  cannot  be  reduced  by  performing  a  single  LL,  RR,  LR,  or  RL  rotation. 

Theorem  8  IfT  is  a  COST  with  n  nodes,  its  height  is  at  most  log^(v/5(n  +  1))  -  2. 

Proof  Let  Nk  be  the  minimum  number  of  nodes  in  a  COST  of  height  h.  Clearly, 
yVo  =  0  and  =  1.  Consider  a  COST  Q  of  height  h  >2  having  the  minimum 
number  of  nodes  Nh-  Q  has  one  subtree  R  whose  height  is  A  -  1  and  another,  5, 
whose  height  \s  <  h-1.  R  must  be  a  minimal  COST  of  height  /i  -  1  and  so  has 
Nh-1  nodes.  R,  in  return,  must  have  one  subtree,  f/,  of  height  /i  -  2  and  another,  V, 
of  height  <h-2.  Both  U  and  V  are  COSTs  as     is  a  COST.  Since  i?  is  a  minimal 
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COST,  is  a  minimal  COST  of  height  /i  -  2  and  so  has  Nh-2  nodes.  Since  Q  is  a 
COST,  |5|  >  max{\U\, \V\}.  We  may  assume  that  Nh  is  a  nondecreasing  function  of 
h.  So,  |5|  >  Nk-2-  Since  Q  is  &  minimal  COST  of  height  h,  \S\  =  Nh-2-  So, 

Nh  =  Nh-1  +  Nk-2  +  1,  h>2 

No  =  0,Ni  =  1. 

This  recurrence  is  the  same  as  that  for  the  minimum  number  of  nodes  in  an  AVL  tree 
of  height  h.  So,  Nh  =  - 1  where  F,-  is  the  t'th  Fibbonacci  number.  Consequently, 
Nh  «  <^''+VV5  -  1  and  /i  <  log^(y5(n  +  1))  -  2.  ° 


Corollary  1  The  maximum  height  of  a  COST  with  n  nodes  is  the  same  as  that  of  an 
AVL  tree  with  this  many  nodes. 

Definition  Let  a  and  b  be  the  root  of  two  binary  trees,  a  and  b  are  ^-balanced, 
0  <  0  <l,  with  respect  to  one  another,  denoted  ^-{a,b),  iff 

(a)  ^{s{a)  -  1)  <  s{b) 

(b)  ^{s{b)  -  1)  <  s{a) 

A  binary  tree  T  is  ^-balanced  iff  the  children  of  every  node  in  T  are  ^-balanced. 

A  full  binary  tree  is  1 -balanced  and  a  binary  tree  whose  height  equals  its  size 
(i.e.,  number  of  nodes)  is  0-balanced. 
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Lemma  2  If  the  binary  tree  T  is  ^-balanced,  then  it  is  -^-balanced  /or  0  <  7  <  /3. 
Proof    Follows  from  the  definition  of  balance. 


Lemma  3  If  the  binary  tree  T  is  ^-balanced,  0  <  /3  <  1/2,  then  it  is  in  WB(a)  for 

Proof    Consider  any  node  p  in  T.  Let  /  and  r  be  node  p's  left  and  right  children. 

sil)  +  1       _  1  


B{p) 


s{l)  +  s{r)  +  2      l  +  ig±l 


Since  T  is  yS-balanced,  s{l)  -  1  <  s{r)/0  or  sil)  +  1  <  s(r)//9  +  2.  So, 


sir)  +  1  -       ^  0isir)  +  1)  - 


or 

sir)  +  1 


>0. 


sil)  +  1 

So,  Bip)  <  1/(1  +  0).  Further,  s(r)  -  1  <  5(/)//3.  So, 

And,  Bip)  >  1/(1  +  l/;9)  =  13/il  +  Hence  /?/(!  +  /3)  <  5(p)  <  1/(1  +  0)  for 
every  p  in  T.  So,  T  is  in  WB(cr)  for  a  =  /?/(!  +  □ 
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o  o 


Figure  3.2.  A  tree  in  WB(l/4)  that  is  not  ^-balanced 


Remark  While  every  /^-balanced  tree,  0  <  ^  <  1/2,  is  in  WB(a)  for  a  =  ^/(l  + 
there  are  trees  in  WB(q)  that  are  not  /?-balanced.  Figure  3.2  shows  an  example  of  a 
tree  in  WB(l/4)  that  is  not  ^-balanced. 

Lemma  i  If  T  is  a  COST  then  T  is  ^-balanced. 

Proof  If  r  is  a  COST,  then  every  subtree  of  T  is  a  COST.  Consider  any  subtree 
with  root  p,  left  child  /,  and  right  child  r.  If  neither  /  nor  r  exist,  then  s{l)  =  s{r)  =  0 
and  p  is  |-balanced.  If  s{l)  —  0  and  s{r)  >  1,  then  r  has  a  nonempty  subtree  with 
root  t  and  s{t)  >  s{l).  So  p  is  not  a  COST.  Hence,  s{r)  <  1  and  p  is  ^-balanced.  The 
same  is  true  when  s{r)  —  0.  So,  assume  s{X)  >  0  and  s{r)  >  0. 

If  s{l)  =  1,  then  s{r)  <  3  as  otherwise,  one  of  the  subtrees  of  r  has  m  >  2 
nodes  and  m  >  s{l)  implies  p  is  not  a  COST.  Since  s{r)  <  3,  \is{r)  —  1)  <  s{l)  and 
^{s{l)  —  1)  <  •s(r).  So,  p  is  ^-balanced.  The  same  proof  applies  when  s{r)  =  1.  When 
s{l)  >  1  and  s{r)  >  1,  let  a  and  b  be  the  roots  of  the  left  and  right  subtrees  of  /.  Since 
p  is  a  COST,  s{a)  <  s{r)  and  s{b)  <  s{r).  So,  s{l)  =  s{a)  +  s{b)  +  1  <  2s{r)  +  1  and 
|(s(/)  —  1)  <  s{r).  Similarly,  |(5(r)  —  1)  <  s{l).  So,  |-(/,r).  Since  this  proof  applies 
to  every  nodes  in  T,  the  children  of  every  p  are  | -balanced  and  T  is  ^-balanced.  □ 
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.0. 


Figure  3.3.  |-balcinced  tree  that  is  not  a  COST 
Remark   There  are  ^-balanced  trees  that  are  not  COSTs  (see  Figure  3.3). 

While  a  COST  is  in  WB(l/3)  and  WB(a)  trees  can  be  maintained  efficiently 
only  for  2/11  <  a  <  1  -  l/\/2  «  0.293,  a  COST  is  better  balanced  than  WB(a) 
trees  with  a  in  the  usable  range.  Unfortunately,  we  are  unable  to  develop  O(log  n) 
insert /delete  algorithms  for  a  COST. 

In  the  next  section,  we  develop  insert  and  delete  algorithms  for  )3-balanced 
binary  search  trees  (/3-BBST)  for  0  <  ;9  <  - 1.  Note  that  every  (^2-  1)-BBST  is 
in  WB(a)  for  a  =  1  -  l/\/2  which  is  the  largest  permissible  or.  Since  our  insert  and 
delete  algorithms  perform  rotations  sAong  the  search  path  whenever  these  result  in 
improved  search  cost,  BBSTs  are  expected  to  have  better  search  performance  than 
WB(a)  trees  (for  a  =  /3/(l  +  0)). 

Each  node  of  a  ^-BBST  has  the  fields  LeftChild,  Size,  Data,  and  RightChild. 
Since  every  ,9-BBST,  ^  >  0,  is  in  WB(a),  for  a  >  0,  ^-BBSTs  have  height  that  is 
logarithmic  in  n,  the  number  of  nodes  (provided  ^  >  0). 
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3.4    Search.  Insert,  and  Delete  in  a  /3-BBST 
To  reduce  notational  clutter,  in  the  rest  of  the  chapter,  we  abbreviate  s{a)  by  a 
(i.e.,  the  node  name  denotes  subtree  size). 
.3.4.1  Search 

This  is  done  exactly  as  in  any  binary  search  tree.  Its  complexity  is  0{h)  where 
h  is  the  height  of  the  tree.  Notice  that  since  each  node  has  a  size  field,  it  is  easy 
to  perform  a  search  based  on  index  (i.e.,  find  the  lO'th  smallest  key).  Similarly,  our 
insert  and  delete  algorithms  can  be  adapted  to  indexed  insert  and  delete. 

3.4.2  Insertion 

To  insert  a  new  element  x  into  a  yS-BBST,  we  first  search  for  x  in  the  ^-BBST. 
This  search  is  unsuccessful  (as  x  is  not  in  the  tree)  and  terminates  by  falling  off  the 
tree.  A  new  node  y  containing  i  is  inserted  at  the  point  where  the  search  falls  off 
the  tree.  Let  p'  be  the  parent  (if  any)  of  the  newly  inserted  node.  We  now  retrace 
the  path  from  p'  to  the  root  performing  rebalancing  rotations. 

There  are  four  kinds  of  rotations  LL,  LR,  RL,  and  RR.  LL  and  RR  rotations  are 
symmetric  and  so  also  are  LR  and  RL  rotations.  The  typical  configuration  before  an 
LL  rotation  is  performed  is  given  in  Figure  3.4(a).  p'  denotes  the  root  of  a  subtree  in 
which  the  insertion  was  made.  Let  p  be  the  (size  of  the)  subtree  before  the  insertion. 
Then,  since  the  tree  was  a  y3-BBST  prior  to  the  insertion,  p-{p,d).  Also,  for  the  LL 
rotation  to  be  performed,  we  require  that  {q  >  c)  and  {q  >  d).  Note  that  q  >  d 
implies  q  >  1.  We  shall  see  that  0-{q,c)  follows  from  the  fact  that  the  insertion  is 
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LL 


q 


d 


q 


c 


c 


(a)  before 


(b)  after 


Figure  3.4.  LL  rotation  for  insertion 


made  into  a  /?-BBST  and  from  properties  of  the  rotation.  Following  an  LL  rotation, 


Lemma  5  [LL  insertion  lemma]  If  [p-{p,  d)  A  H^^c)  A{q  >  c)  A{q  >d)J  for  0  <  0  < 
1/2  before  the  rotation,  then  0-{q,gp')  and  ^-{c,d)  after  the  rotation. 


(a)  0{q-l)  <  c  (as  ^-{q,c))  <  gp'.  Also,  I3{gp'-l)  =  Pic  +  d)  <  (as  ^  >  0,  q  >  c 
and  9  >      <  9  (as  ^  <  1/2).  So,  H<]^9P')- 

(h)  d  <  q  ^  d-  \  <  q-l  ^  Hd-l)  <  l^iq-l)  <  cias  /?-(9,c)).  Also, 
^{c-l)<p{q  +  c-l)  =  fi{p'-2)=P{p-l)<d{as^-{p,d)).  So,  Hc,d).  □ 

In  an  LR  rotation,  the  before  configuration  is  as  in  Figure  3.4(a).  However,  this 
time  q  <c.  Figure  3.4(a)  is  redrawn  in  Figure  3.5(a).  In  this,  the  node  labeled  c  in 
Figure  3.4(a)  has  been  labeled  q  and  that  labeled  q  in  Figure  3.4(a)  has  been  labeled 


p'  is  updated  to  be  the  node  p". 


Proof    Assume  the  before  condition. 
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(a)  before  (b)  after  substep  (i) 

Figure  3.5.  Substep  (i)  of  insertion  LR  rotation 

a.  With  respect  to  the  labelings  of  Figure  3.5(a),  rotation  LR  is  applied  when 

[{q  >a)A{q>  d)]. 

The  other  conditions  that  apply  when  an  LR  rotation  is  performed  are 

[HP^d)Ap-{a,q)AHb,c)]. 

Here  p  denotes  the  (size  of  the)  left  subtree  of  gp  prior  to  the  insertion.  An  LR 
rotation  is  accomplished  in  two  substeps  (or  two  subrotations).  The  first  of  these  is 
shown  in  Figure  3.5(b).  Following  an  LR  rotation,  p'  is  updated  to  be  node  q'. 

Lemma  6  [LR  substep(i)  insertion  lemma]  If  [/3-{p,d)  A  0-{a,q)  A  0-{b,c)  A  {q  > 
a)A{q  >  d)JforO  <  ^  <  1/2  before  the  subrotation,  then  [3-{p",gp')A{{/3-{a,b)A^- 
{c,d))  V  {j^-{a,b)  A  0-{c,d))}J  after  the  subrotation. 
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Proof     Assume  the  before  condition.    First,  we  show  that  0-{p",gp')  after  the 
rotation.  Note  that  0{p"  -  1)  =  /?(a  +  6)  =  /?(a  +  6  +  c  +  1)  -  /?(c  +  1)  =  Hp'  "  1)  " 
I3{c+l)  =  I3{p~l)-^c<d-^c<d<  gp'.  Also,  0{gp'  -  I)  =  P{c  + d)  <  b  + 0  + 
(as  /?-(6,c))  <b  +  ^qiasq>d)<b  +  a  +  ^{8LS  0-{a,q))  <  p"  {as  ^  <  1/2  and 

p"  =  a  + 6  +  1).  So,  Hf^gp')- 

Next,  we  prove  two  properties  that  will  be  used  to  complete  the  proof. 
PI:  /3(6-l)  <a- 

To  see  this,  note  that  ^{h  -  I)  <  ^{q  -  I)  <  a  (as  $-{a,  q)). 
P2:  /?(c-  1)  <  d. 

For  this,  observe  that  p' -  1  =  a  +  9  >  /?(g  -  1)  +  9  (as  ^-(a,?))  =  (/3  +  1)(9  -  1)  +  1. 
So,  9  -  1  <  ^  =  Similarly,  g  -  1  =  6  +  c  >  ^(c  -  1)  +  c  (as  ^-(6,c)) 

=  (^  +  l)(c  -  1)  +  1.  So,  ^(c  -  1)  <  ^(9  -  2)  <  ^(9  -  1)  <  <  (as 

/9-(p,c/))<d. 

To  complete  the  proof  of  the  lemma,  we  need  to  show 

{(^-(a,  h)  A  ^-(c,  rf))  V  ( Y^-(«,  6)  A  ;S-(c,  d))}. 
We  do  this  by  considering  the  two  cases  b>  c  and  b  <  c. 

Case  b  >  c:  Since  a  <  q  =  b  +  c  +  I,  I3{a  -  1)  <  /3{b  +  c)  <  2^b  <  b.  This  and 
PI  imply  /?-(a,6).  Also,  d<q  =  b  +  c+l.   So,  ^(<f  -  1)  <  ^(6  +  c  -  1)  = 
+  ^(6  -  1)  <         +  ^  (as  0-{b,c))  =  c.  This,  together  with  P2  implies 
y|^-(c,d).  So,  Ha,b)A4-^-{c,d). 

Case  b  <  c:  Since  a  <  q  -  b  +  c l,a  -  I  <  6  +  c.   So,  a-1  <6  +  c-  lor 
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Also,(i-l  <g-2  =  6  +  c-l.  So,  ^{d  -  \)  <  0{b  +  c  -  I)  <  ^i2c  -  1)  <  c.  This, 
together  with  P2  implies  ^-(c,  tf).  So,  ft)  A  ^-(c,  d).  □ 

Since  an  LR(i)  rotation  can  cause  the  tree  to  lose  its  /^-balance  property,  it  is 
necessary  to  follow  this  with  another  rotation  that  restores  the  /9- balance  property. 
It  suffices  to  consider  the  two  cases  of  Figures  3.6  and  3.7  for  this  follow  up  rotation. 
The  remaining  cases  are  symmetric  to  these.  In  Figures  3.6  and  3.7,  p  and  d  denote 
the  nodes  that  do  not  satisfy  l3-{p,d).  Note,  however,  that  these  nodes  do  satisfy 

Since  the  follow  up  rotation  to  LR(i)  is  done  only  when 


^  -{p,d)A{-^HP,d)l 


1  +  ^ 


either  l3{p-\)  >  d  or  l3{d-\)  >  p.  When  /3(p  - 1)  >  rf,  the  second  substep  rotation  is 
one  of  the  two  given  in  Figures  3.6  and  3.7.  When  0{d-l)>  p,  rotations  symmetric 
to  these  are  performed.  In  the  following,  we  assume  0{p  -  I)  >  d.  Further,  we  may 
assume  (i  >  0,  as  J  =  0  and  :^-{p,d)  imply  p  <  1.  Hence,  /3-{p,d).  Also,  <i  >  0  and 
0{p  -I)  >  d  imply  p  >  1. 

The  LR(ii)  LL  rotation  is  done  when  the  condition 


A  =  {q>d)A{c<{l+/3)q  +  il-  0))  A  B  where 
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(a)  before  (b)  after 

Figure  3.6.  Case  LL  for  LR(ii)  rotation 


B  =  -^—{p,d)  A  (-;9-(p,rf))  Ay9-(9,c)  A  (^(p  -  1)  >(i  >  0). 

1   ~|~  P 

Lemma  1  [Case  LR(ii)  LL  rotation]  If  A  holds  before  the  rotation  of  Figure  3.6,  then 
l3-{q,gp')  and  ^-{c,d)  after  the  rotation  provided  0  <  ^  < 

Proof    {a.)  H<1^9P'): 

P{q-l)  <  c  (as  ^-(9,c))  <  gp'.  Also,  fi{gp'-l)  =  0{c+d)  <  < 
^(1  +  0)q  +  ^(1  _  ^)  +  ^{q  -  1)  (as  9  ><f)  =  /?(2  +        -      <  q  (as  /?(2  +  ^)  <  1 

for  0  <  /3  <  ^  -  1).  So,  H<1^9P')- 
(b)  /?-(c,(i): 

^(d  -  1)  </?(?-  1)  <  c  (as  /9-(9,  c)).  And,  ^(c  -  1)  =  ^^{0  -  1)  +  y|^(c  -  1)  < 
4p<i  +  t|^(^  -  1)  =  1^(9  +  -  -  1)  =  -  2)<  t|^(p  -  1)  <  ^  (as  i|^-(p,d)). 

So,^-(c,(f).  ° 


Lemma  8  If  (c  <  (1  +  +  (1 
provided  Q  <  /3  <  V^-  1. 


-  d))  A  (/3(p  -  1)  >  <f)  in  Figure  3.6,  then  d  <  q 
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Proof  Sinced  <  0{p-l)  =  ^{q  +  c)  <  0{qHl+^)q  +  l-0)  =  ^(^+2)9  +  ^(1-^)  < 
9  +  1  (as        +  2)  <  1  and        -  /S)  <  1  for  0  <  ^  <  ^2  -  1).  So,  d  <  q.  □ 

So,  the  only  time  an  LR(ii)  LL  rotation  is  not  done  is  when  C  =  (Ci  V  C2)  A  B 
holds  where 

C,  =  {q  =  d)/\{c<{l  +  /3)q  +  l-  0) 
C2  =  c>(l  +  ^)9  +  (l-^)- 

At  this  time,  the  LR  rotation  of  Figure  3.7  is  done.  In  terms  of  the  notation  of 
Figure  3.7,  the  condition  C  becomes  D  =  (Di  V  D2)  A  £  where 

Dr  =  {a  =  d)A{q<{l  +  l3)a  +  l  -0) 

=  9  >  (1  + /3)a  +  1  - /? 
E  =  Y^-(P,  d)  A  -^-(p,  d)  A  H<^,  q)  A         c)  A  (^(p  -  1)  ></  >  0). 

Lemma  9  When  an  LR(ii)  LR  rotation  is  performed  and  ^  <  \/2  -  \,  q  >  d  and  so 
search  cost  is  reduced. 

Proof  If  Du  then  since  d  <  ^{p  -  I)  =  /S{a  +  q)  =  I3{d  q),  q  >  d/ 0  -  d  >  d  as 
/3  <  v/2  -  1.  If  P2,  then  d  <  0{p-l)  ^  0{a  +  q)  <  +q)  =  ^q  -  ^  < 

^9<9(a^^<v/2-l).  ° 
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(a)  before  (b)  after 

Figure  3.7.  Case  LR  for  LR(ii)  rotation 

Lemma  10  When  {d  =  a)  A /3-{b,c)  A  {^{p  -  1)  >d)  A  {/3  <  V2  -  I)  (see  Figure  3.7), 
/3{a-l)  <  b  and  0{d  -  I)  <  c. 

Proof    Since  ^{p  -  1)  >  d  and  d  =  aj{p  -  I)  >  a  or  ^{a-\- q)  >  a  or  a(l  -  < 
ora<^,q.  So,  ^(a-l)<^g-^=^(6  +  c+l)-^. 
If  c  <  I  +  /3,  then 

~      1-^  I-^  ^ 

/3(;g+l)6  + 

1-/9 

<    h  (as        +  1)  <  1  -  /9  for  /9  <  v/2  -  1  and  ^2  +      -  1  <  0  for     <  v/2  -  1). 
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Since/3(c-l)<6,c<|  +  l.  So, 


^(a-l)<^(6  +  c+l)<^(6+-+2)<^^y^  +  -^. 


So, 


+  1  3^-1 
a  —  I  <  z  + 


1-/?  1-0 

However,  since  /J^  +  2^  -  1  <  0  for  ^  <  x/2  -  1,  (1  +  ^)/(l  -  ^)  <  J  and 
(3^  -  1)/(1  -  0)<I3.  So,  a  - 1<  b/l3  +  13.  If  a  >  c  +  1,  then  c<a-l<b/0  +  (3. 
We  have  already  shown  that  for  c  <  b/ 0  +  0,  ^{0  -  I)  <  b.  So,  assume  a  <  c+l. 
Now,  a  <  c  and  ^(a  -  1)  <  ^(c  -  1)  <  6  (a^  ^-(6,c)).  So,  ^(a  -  1)  <  6  in  all  ca^es. 
/3(a  -  1)  <  c  may  be  shown  in  a  similar  way.  Since  a  =  d,  v/e  get  ^{d  -  1)  <  c.  □ 


Lemma  11  [Case  LR(ii)  LR  rotation]  If  D  holds  before  the  rotation  of  Figure  3.7, 
then  l3-{p',gp'),/3-{a,b),  and  0-{c,d)  following  the  rotation  provided  0  <  ^  <  \/2  -  1. 

Proof    (a)  ^-(p',5P'): 

P[gp'  -  I)  =  0{c  + d)  <  b+  /3  +  /3d  {as  ^-(fe,c))  <  b  +  0  +  0q  (from  Lemmas  9 
&nd  10,  q>d)  <b  +  p  +  a  +  0  =  a  +  b  + 2/3  <a  +  b+l=p'.  Also,  since  y|^-(p,  d) 
and  q  >  d,l3{p  -  1)  <  (y3  +  l)d  or  /3{a  +  q)  <  +  l)d  or  a  +  q  <  {I  +  ^)d  or 
a<{i  +  l)d-q<{\  +  ^)d-d^d/l3.  So,  ^(p'-l)  =  ^(a  +  6)<(i  +  /36<d  +  c  +  ^ 
(as  /S-(6,c))  <  (f+c  +  1  =  V- 
(b)  /5-(a,6): 

Since  6  <  9  and  B-{a,q),l3{b-  I)  <  0{q  -  I)  <  a. 
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When  Di,  0{a  -  I)  <  b  was  proved  in  Lemma  10.  So,  ^-{a,b). 
When  I>2,  q>a{\  +  0)  +  l-0.  So, 


''-1  +  ^     1  +  /?"    1+^  l+P' 


So, 


+    +  ^    1-/9     .^^b  +  b  +  2fi_i^  ^^^^ 


So,  ^-(a,6). 

(c)  Hc,dy. 

Note  that  ^(c  -  1)  <  ^{q  -  1)  <  t|^(9  -  1)  <  ifeb  "  1)  <  ^• 
When  Di,  0{d  -  1)  <  c  was  proved  in  Lemma  10.  So,  /3-{c,d). 

When  D2,  if  d  <  6  +  1,  then  <i  <  6  and  1)  <  I3{b-l)  <  c.  So,  assume  cf  >  6+  1. 

Now,  6  <  (i  -  1  <  /5(p  -  1)  -  1.  So, 


6   <  ^(a  +  6  +  c+l)-l 

<  ^(^i^  +  ^  +  c+l)-l 

=  ^(6  +  c  +  ^  +  (l  +  ;9)(6  +  c  +  l))-l 

<  Y^(^  +  l  +  c  +  /3  +  (l+;9)(^  +  l+c+l))-l 

=  c  +  /3+(l+^)c+2^-l 

=  (2  +  /3)c  +  3;9-l<(2  +  ^)c+/3(as^<v/2-l) 
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Also,  from  d  <  ^{p  -  1)  and  the  above  derivation,  we  get 


d   <    Y^(6  +  c  +  /?  +  (l  +  ^)(6  +  c+l)) 
=   {2  +  l3)c+  

<    (2  +  /3)c+l  (as^3  +  4i92  +  y9<  l+y5fory9<  v/2-1). 


So,  ^{d  -1)<  13(2  + fi)c<c  {as  13  <y/2-l).  So,  i9-(c,  d). 


Theorem  9  If  T  is  0 -balanced,  0  <  /9  <  v/2  -  1,  prior  to  insertion,  it  is  so  following 
the  insertion. 

Proof  First  note  that  since  all  binary  search  trees  are  balanced  for  ^  =  0,  the  rota- 
tions (while  unnecessary)  preserve  0-balance.  So,  assume  ^  >  0.  Consider  the  tree  T' 
just  after  the  new  element  has  been  inserted  but  before  the  backward  restructuring 
pass  begins. 

If  the  newly  inserted  node,  2,  has  no  parent  in  T',  then  T  was  empty  and  T'  is 
;9-balanced.  If  2  has  a  parent  but  no  grandparent,  then  T  has  at  most  one  nonempty 
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subtree  X.  Since  T  is  ^-balanced,  ^(l^l  -  1)  <  0.  So,  |X|  <  1.  Following  the 
insertion,  T  has  one  subtree  with  <  1  nodes  and  one  with  exactly  one.  So,  T  is 
^-balanced.  We  may  therefore  assume  that  2  has  a  grandparent  in  V . 

From  the  downward  insertion  path,  it  follows  that  all  nodes  u  in  T  that  have 
children  /  and  r  for  which  -i9-(/,r)  must  lie  on  the  path  from  the  root  to  z.  During 
the  backward  restructuring  pass,  each  node  on  this  path  (other  than  z  and  its  parent) 
play  the  role  of  gp  in  Figures  3.4  and  3.5.  The  ^-property  cannot  be  violated  at  z 
as  2  has  no  children.  It  cannot  be  violated  at  the  parent,  5,  of  2  as  s  satisfied  the 
^-property  prior  to  insertion.  As  a  result  its  other  subtree  has  <  1  element.  So, 
following  the  insertion,  s  satisfies  the  ^-property.  As  a  result,  each  node  in  V  that 
might  possibly  violate  the  ^-property  becomes  the  gp  node  during  the  restructuring 
pass.  Consider  one  such  gp  node.  It  has  children  in  T  denoted  by  p'  and  d.  Its 
children  in  T  are  p  and  d.  Figures  3.4  and  3.5  show  the  case  when  d  is  the  right 
subtree  of  gp  in  both  T  and  T.  The  cases  RR  and  RL  arise  when  d  is  the  left 
subtree. 

During  the  restructuring  pass,  gp  begins  at  the  grandparent  of  z  and  moves  up 
to  the  root  of  T .  If  z  is  at  level  r  in  T',  (the  root  being  at  level  1),  then  gp  takes  on 
r  -  2  values  during  the  restructuring  pass.  We  shall  show  that  at  each  of  these  r  -  2 
positions  either 

(a)  no  rotation  is  performed  and  all  descendants  of  gp  satisfy  the  ^-property  or 

(b)  a  rotation  is  performed  and  following  this,  all  descendants  of  node  p"  (Figure  3.4) 
or  of  node  q'  (Figure  3.5)  satisfy  the  /3-property. 
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As  a  result,  following  the  rotation  (if  any)  performed  when  gp  becomes  the  root 
of  r,  the  restructured  tree  is  ^-balanced.  The  proof  is  by  induction  on  r.  When 
r  =  3  (recall,  we  assume  ^  has  a  grandparent),  gp  begins  at  the  root  of  T'  and  its 
descendants  satisfy  the  yS-property. 

Without  loss  of  generality,  assume  that  the  insertion  took  place  in  the  left  subtree 
of  gp.  With  respect  to  Figure  3.4,  we  have  three  cases:  (i)  g  >  c  and  9  ></,  (ii)  q<c 
and  Od,  and  (iii)  9  <  (i  and  c  <  <i.  In  case  (i),  all  conditions  for  an  LL  rotation  hold 
and  such  a  rotation  is  performed.  In  case  (ii),  an  LR  rotation  is  performed.  Following 
either  rotation,  T'  is  /3-balanced.  In  case  (iii),  /3(p'  -  1)  =  /?(?  +  c)  <  2/3d  <  d  (as 
/9  <  v/2  -  1).  Also,  p{d-l)<p<p+l=p'-  So,  ^{d  -  1)  <  p'.  Hence,  /3-(p',(i) 
and  T'  is  ^-balanced. 

For  the  induction  hypothesis,  assume  (a)  and  (b)  whenever  r  <  k.  In  the 
induction  step,  we  show  (a)  and  (b)  for  trees  T  with  r  =  k  +  I.  The  subtree  in 
which  the  insertion  is  done  has  r  =  ^.  So,  (a)  and  (b)  hold  for  all  gp  locations  in  the 
subtree.  We  need  to  show  (a)  and  (b)  only  when  gp  is  at  the  root  of  T'.  This  follows 
from  Lemmas  5,  6,  7,  and  11. 

The  theorem  now  follows.  ^ 


Lemma  12  The  time  needed  to  do  an  insertion  in  an  n  node  j3-BBST  is  0(\ogn) 
provided  0  <  ^3  <       —  1  • 
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Proof  Follows  from  the  fact  that  insertion  takes  0(h)  time  where  h  is  the  tree 
height  and  h  =  O(logn)  when  /3  >  0  (Lemmas  1  and  3).  O 

3.4.3  Deletion 

To  delete  element  x  from  a  ^-BBST,  we  first  use  the  unbalanced  binary  search 
tree  deletion  algorithm  of  Horowitz  and  Sahni  [14]  to  delete  x  and  then  perform  a 
series  of  rebalancing  rotations.  The  steps  are: 

Step  1  [Locate  x]  Search  the  /3-BBST  for  the  node  y  that  contains  x.  If  there  is  no 
such  node,  terminate. 

Step  2  [Delete  x]  If  y  is  a  leaf,  set  d'  to  nil,  gp  to  the  parent  of  y,  and  delete  node 
y.  If  y  has  exactly  one  child,  set  d!  to  be  this  child;  change  the  pointer  from 
the  parent  (if  any)  of  y  to  point  to  the  child  of  y;  delete  node  y;  set  gp  to  be 
the  parent  of  d' .  If  y  has  two  children,  find  the  node  z  in  the  left  subtree  of  y 
that  has  largest  value;  move  this  value  into  node  y;  set  y  =  2;  go  to  the  start 
of  Step  2.  {  note  that  the  new  y  has  either  0  or  1  child  } 

Step  3  [Rebalance]  Retrace  the  path  from  <f'  to  the  root  performing  rebalancing 
rotations. 

There  are  four  rebalancing  rotations  LL,  LR,  RR,  and  RL.  Since  LL  and  RR 
as  well  as  LR  and  RL  are  symmetric  rotations,  we  describe  LL  cind  LR  only.  The 
discussion  is  very  similar  to  the  case  of  insertion.  The  differences  in  proofs  are  due 
to  the  fact  that  a  deletion  reduces  the  size  of  encountered  subtrees  by  1  while  an 
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d' 


LL 


q 


d' 


c 


c 


q 


(a)  before 


(b)  after 


Figure  3.8.  LL  rotation  for  deletion 


insertion  increases  it  by  1.  In  an  LL  rotation,  the  configuration  just  before  and  i 
the  rotation  is  shown  in  Figure  3.8.  This  rotation  is  performed  when  q  >  c 
q>d'.  Following  the  rotation,  d'  is  updated  to  the  node  p'. 

Let  d  denote  the  size  of  the  right  subtree  of  gp  before  the  deletion.  So,  d  =  d 
Since  prior  to  the  deletion  the  /3-BBST  was  ^-balanced,  it  foUows  that  HP^d) 


Lemma  IS  [LL  deletion  lemma]  If  [l3-{p,  d)  A  /^-(g,  c)  h{q>  c)  ^{q  >  d)  ^  (1/3  <  ^  < 
1/2)7  before  the  rotation,  then  [^-{q,gp')  A  ^-[c,d')]  after  the  rotation. 

Proof  (a)y3-(g,V): 

P{q  -  1)  <  c  (as  Hq^c))  <  gp'.  Also,  ^{gp'  -  1)  =  /5(c  +  d')  <  20q  {as  c  <  q  and 
d'  <q)<q{as0<  1/2).  So,  Hq^9P')- 
(b)  Hc,d'): 

d'  <q^d'-l<q-l-^  P{d'-l)  <l3{q-l)<c.  Also,  when  c  <  1,  y9(c-l)  <  0  <  J' 
(as  d'  >  0).  When  c  >  l,q  >  c^q>2a.ndp  =  q  +  c+l  >c  +  3.  So, 
I3{c-l)  <  0{p-l)-Sl3  <  d-3f3  {as  Hp,  d))  <  d-\  {as  13  >\/3)  =  d' .  Hence, 


^-(9,c). 
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In  an  LR  rotation,  the  before  configuration  is  as  in  Figure  3.8(a).  However,  this 
time  q  <  c.  Figure  3.8(a)  is  redrawn  in  Figure  3.9(a).  In  this,  the  node  labeled  c 
in  Figure  3.8(a)  has  been  relabeled  q  and  that  labeled  q  in  Figure  3.8(a)  has  been 
relabeled  a.  With  respect  to  the  labelings  of  Figure  3.9(a),  rotation  LR  is  applied 
when 

[{q  >a)A{q>  d% 
The  other  conditions  that  apply  when  an  LR  rotation  is  performed  are 

[HP^d)A0-{a,q)AHb,c)]. 

Here  d  denotes  the  (size  of)  right  subtree  of  gp  prior  to  the  deletion.  As  in  the  case 
of  insertion,  an  LR  rotation  is  accomplished  in  two  substeps  (or  two  subrotations). 
The  first  of  these  is  shown  in  Figure  3.9.  Following  an  LR  rotation,  d'  is  updated  to 
node  q'. 

Lemma  U  [LR  substep(i)  deletion  lemma]  If  [l3-{p,  d)  A  ^-(a,  q)  A  ^-(6,  c)  A  (9  >  a)  A 
{q  >  d')]  before  the  subrotation  LR(i),  then  [l3-{p\gp')A{il3-ia,b)AY^-{c,d'))y{j^- 
(a,6)  A  l3-{c,d'))}]  after  the  subrotation  provided  1/3  <  0  <  1/2  . 

Proof    Assume  the  before  condition. 

(a)  If  6  =  c  =  0,  then  q  =  6  +  c  +  1  =  1.  Furthermore,  {q  >  a)  and  {q  >  d')  imply 
a  =  d'  =  0.  So,  gp'=p'^l.  Hence,  [l-{p\gp')  A  \-{a,b)  A  \-{c,d')] 
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(a)  before  (b)  ^^er  substep  (i) 


Figure  3.9.  LR  rotation  for  deletion 

(b)  If  fe  =  1  and  c  =  0,  then  9  =  2,  a  <  1,  and  d'  <  1.  So,  1  <  p'  <  3  and  1  <  5//  <  2. 
Hence,  [|-(p',5P')  A  A  H^,  ^')] 

(c)  If  6  =  0  and  c  =  1,  then  9  =  2,  a  <  1,  and  tf'  <  1.  So,  1  <  p'  <  2  and  1  <  gp'  <  3. 
Hence,  [|-(p',5p')  A  \-{a,b)  A  \-{c,d')\ 

As  a  result  of  (a)  -  (c),  to  complete  the  proof,  we  may  assume  that  6  >  1  and  c  >  1. 
So,  9  >  3,  a  >  1  (as  /3-(a,  q)  ^  ^{q- \)  <  a  or  a  >2^  >  (i),  p  =  a  +  \  >  b,d>2 
(35  /?-(p,(f)  ^^{p-\)<d  and  ^  >  1/3),  and  J'  =  (i  -  1  >  1. 

First,  we  show  that  Hp'^9P')-  For  this,  note  that  a  +  fe  +  c+  l=  p-l.  From 
^-(p,  d),  it  follows  that  I3{a  +  b  +  c+l)  =  I3{p-l)  <d.  So,  fi{a  +  b)<d-^c- 
From  Figure  3.9(b),  we  see  that /?(p' -  1)  =  ^(a  +  i).  Hence,  ^(p' -  1)  <d-^c-l3  = 
d'  -  I3c+1-  ^  <  d'  +  1  -  2/3  <  gp'.  Also, 


fi{gp' -  \)  =  Pic  +  d')    <    b  +  13  +  0d'  (as  Hb,c)) 

<    b  +  l3q  +  l3{asq>d') 
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<   b  +  a  +  20  {as  ^-{a,q)) 


<  P- 


So,  Hp\9P')- 

Next,  we  prove  two  properties  that  will  be  used  to  complete  the  proof. 
PI:  ^{b-l)<a. 

To  see  this,  note  that  ^{b  -  1)  <  ^(9  -  1)  <  a  (as  /3-(a,  q)). 
P2:  ^(c-1)  <<i'. 

For  this,  observe  that  /?(c  -  1)  <       -  2)  (as  c  <  9  -  1)  <  ^(p  -  4)  (as  q  =  p  -  a  -  1 
and  a  >  1)  =  ^(p  -  1)  -  3y9  <    -  1  (as  /9-(p,</)  and  13  >  1/3)  =  d'. 
To  complete  the  proof  of  the  lemma,  we  need  to  show 


{{Ha,  b)  A  ^-(c,  d'))  V  ( Y^-(a,  b)  A  ^-(c,  (f))}. 


For  this,  consider  the  two  cases  b>  c  and  6  <  c  (as  in  Lemma  6). 

Case  b>c:  Since  a  <  q  =  b  +  c  +  lj{a  -  I)  <  P{b  +  c)  <  2/3b  <  b.  This,  together 

with  PI  implies  ^-(a,  b).  Also,  (f'  <  9  =  6  +  c  +  1.  So,  ^(<f'  -  1)  <  ^(6  +  c  -  1)  = 

jflfl^  +  ^(^  -  1)  +  ^  =  ^-  ™s,  together  with  P2  implies  j^-{c,  d').  So, 

/3-(a,6)A4^-(c,^'). 

Case  6  <  c:  Since  a  <  q  =  b  +  c  -\-  \,a  -  I  <  b  c.  So,  a-l<6  +  c-  l 
or^<I?^  +  ^<I^  +  A=^-  This  and  PI  imply  ^-(a,6).  Also, 
^'  _  1  <  ^  _2  =  6  +  c-  1.  So,  ^((/'  -  1)  <  /3(fe  +  c-  1)  <  /9(2c-  1)  <  c.  This  and 
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P2  imply  Hc,d').  Hence,  ^^^-(0,6)  A  ^-(c,(f') 


The  substep(ii)  rotations  are  the  same  as  for  insertion. 

Theorem  10  IfT  is  ^-balanced,  then  following  a  deletion  the  resulting  tree  T  is  also 
^-balanced  provided  1/3  <  /3  <       -  1. 

Proof    Similar  to  that  of  Theorem  9.  ° 

When  0  <  /?  <  1/3,  we  need  to  augment  the  LL  rotation  by  a  transformation 
for  the  case  d'  =  0.  When  d'  =  ^{p  -  I)  <  d  =  d'  +  \  =  \.  So,  p  <  1/^  +  1  and 
=  p  4-  d'  +  1  <  l/;9  +  2.  To  /3-balance  at  gp,  the  at  most  Ij^  +  2  nodes  in  gp 
are  rearranged  into  any  ^-BBST  in  constant  time  (as  1//?  +  2  is  a  constant).  When 
d'  >  0,  the  proof  of  Lemma  13  part  (b)  can  be  changed  to  show  /3(c  -  1)  <  <i'  for 
0  <  <  \/2  -  1-  The  new  proof  is:  since  c  <  q,c  <  {p  -  l)/2  and  ^(c  -  1)  < 
I3{p -l)/2  -  0  <  d/2  -  ^  =  d-  d/2  -  ^  <  d-1  -  ^  <  d'.  The  LR  rotation  needs 
to  be  augmented  by  a  transformation  for  the  case  d'  =  d  -  1  <  0^2+0)  -  1-  At  this 
time,  ^(p  -  1)  <    <  So,  5P  =  P  +  d  <  p^TT)  +  ^  +  W^V       /^'balance  at 

gp,  we  rearrange  the  fewer  than  p^jj^^  +  1  +  ;g(^  nodes  in  the  subtree,  in  constant 
time,  into  any  ^-balanced  tree.  When  d'  >  -  1,  the  proof  for  0{c  -  1)  <  d'  in 

Lemma  14  needs  to  be  changed  to  show  that  the  LR  substep(i)  lemma  holds.  The 
new  proof  is: 


d   >   /3(p-l)  =  /?(a  +  6  +  c+l)>/3(/3(«7-l)  +  6  +  c+l) 
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=   0{/3{b+c)  +  b  +  c+l) 

>   ^((l  +  ^)^(c-l)  +  (l  +  ^)c+l) 

=   |9((l  +  ^)'(c-l)  +  2  +  /?). 

So,  ^(c  -  1)  <  <d-l{^d>  ^)  =  cf. 

Also,  note  that  when  /3  =  0,  all  trees  are  /^-balanced  so  the  rotations  (while  not 

needed)  preserve  bcilance. 

Theorem  11  With  the  special  handling  of  the  case  d'  =  0,  the  tree  V  resulting  from 
a  deletion  in  a  p-BBST  is  also  ^-balanced  for  0  <  ^  <  y/2  -  I. 

Lemma  15  The  time  needed  to  delete  an  element  from  an  n  node  ^-BBST  is  O(\ogn) 
provided  0  <  ^9  <       —  1  • 

3.4.4  Enhancements 

Since  our  objective  is  to  create  search  trees  with  minimum  search  cost,  the 
rebalancing  rotations  may  be  performed  at  each  positioning  oigp  during  the  backward 
restructuring  pass  so  long  as  the  conditions  for  the  rotation  apply  rather  than  only 
at  gp  positions  where  the  tree  is  unbalanced. 

Consider  Figure  3.4(a).  If  p'  <  d,  then  the  conditions  of  Lemmas  5  and  6  cannot 
apply  asq<p'  <d.  However,  it  is  possible  that  e  >  p'  where  e  is  the  size  of  either  the 
left  or  right  subtree  of  d.  In  this  case,  an  RR  or  RL  rotation  would  reduce  the  total 
search  cost.  The  proofs  of  Lemmas  5  and  6  are  easily  extended  to  show  that  these 
rotations  would  preserve  balance  even  though  no  insertion  was  done  in  the  subtree 
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d.  The  same  observation  applies  to  deletion.  Hence  the  backward  restructuring  pass 
for  the  insert  and  delete  operations  can  determine  the  need  for  a  rotation  at  each  gp 
location  as  below  (/  and  r  are,  respectively,  the  left  and  right  children  of  gp). 
if  s{l)  >  s{r)  then  check  conditions  for  an  LL  cind  LR  rotation 
else  check  conditions  for  an  RR  and  RL  rotation. 
The  enhanced  restructuring  procedure  used  for  insertion  and  deletion  is  given  in 
Figure  3.10.  In  the  RR  and  RL  cases,  we  have  used  the  relation  '>'  rather  than  '>' 
as  this  results  in  better  observed  run  time. 

Since  it  can  be  shown  that  the  rotations  preserve  balance  even  when  there 
has  been  no  insert  or  delete,  we  may  check  the  rotation  conditions  during  a  search 
operation  and  perform  rotations  when  these  improve  total  search  cost. 

Finally,  we  note  that  it  is  possible  to  use  other  definitions  of  ;3-balance.  For 
example,  we  could  require  ^{s{a)  -  2)  <  s{b)  and  /3{s{b)  -  2)  <  s{a)  for  ^-(a,  b). 
One  can  show  that  the  development  of  this  chapter  applies  to  these  modifications 
also.  Furthermore,  when  this  new  definition  is  used,  the  number  of  comparisons  in 
the  second  substep  of  the  LR  and  RL  rotations  is  reduced  by  one. 

3.4.5    Top  Down  Algorithms 

As  in  the  case  of  red/black  and  WB(a)  trees,  it  is  possible  to  perform,  in  0(log  n) 
time,  inserts  and  deletes  using  a  single  top  to  bottom  pass.  The  algorithms  are  similar 
to  those  already  presented. 
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procedure  Restructuring  ; 
begin 

while  (gp)  do 
begin 

if  {s{gp.left)  >  s{gp.right))  then 

begin  {check  conditions  for  an  LL  and  LR  rotation} 
p  =  gp.left  ; 

if  {s{p.left)  >  s{p.right))  then 

begin  if  {s{p.left)  >  s{gp.right))  then  do  LL  rotation;  end 
else 

begin 

if  {s{p.right)  >  s{gp.right))  then  {LR} 
begin 

do  LR  rotation  ; 

{  now  notations  a,6,c,  and  d  follow  from  figure  3.1(b)  } 
if  {I3{s{a)-l)  >  sib))  then 

if  {{s{a.right)  <  (1  +  I3)s{a.left)  +  1  -  /?)  and 
(5(6)  <  s{a.left)))  then 
do  LL  rotation 
else  do  LR  rotation 
else  if  {^{s{d)-\)  >  s{c))  then 

if  {{sid.left)  <  (1  +  0)s{d.right)  +  1-/3)  and 
(5(c)  <  s{d.right)))  then 
do  RR  rotation 
else  do  RL  rotation  ; 

end 

end 

end 

else  {check  conditions  for  an  RR  and  RL  rotation} 
begin 

p  =  gp.right  ; 

if  (sip.left)  >  s{p.right))  then 
begin 

if  {sip.left)  >  s{gp.lefi))  then  {RL} 
do  symmetric  to  the  above  LR  case  ; 

end 

else 

begin  if  {s{p.right)  >  s{gp.left))  then  do  RR  rotation;  end  ; 
end  ; 
gp  =  gp.parent  ; 
end  ; 
end  ; 

Figure  3.10.  Restructuring  procedure 
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3.5    Simple /9-BBSTs 

The  development  of  Section  3.4  was  motivated  by  our  desire  to  construct  trees 
with  minimal  search  cost.  If  instead,  we  desire  only  logarithmic  performance  per 
operation,  we  may  simpHfy  the  restructuring  pass  so  that  rotations  are  performed 
only  at  nodes  where  the  ^-balance  property  is  violated.  In  this  case,  we  may  dispense 
with  the  LL/RR  rotations  and  the  first  substep  of  an  LR/RL  rotation.  Only  LR/RL 
substep  (ii)  rotations  are  needed.  To  see  this,  observe  that  Lemmas  7  and  11  show  that 
the  second  substep  rotations  rebalance  at  gp  (see  Figures  3.6  and  3.7)  provided 
(p,  d)  (The  remaining  conditions  are  ensured  by  the  bottom-up  nature  of  restructuring 
and  the  fact  the  tree  was  ^-balanced  prior  to  the  insert  or  delete). 

If  the  operation  that  resulted  in  loss  of  balance  at  gp  was  an  insert,  then  ^(p  - 
2)  <  d  {as  p  >  d,  the  insert  took  place  in  subtree  p  and  gp  was  /3-balanced  prior 
to  the  insert)  and  /?(p  -  I)  >  d  [gp  is  not  ^-balanced  following  the  insert).  For  the 
substep  (ii)  rotation  to  restore  balance,  we  need  /3(p  -  1)  <  (1  +  0)d.  This  is  assured 
'dd  + 13  <{l3+\)d  (as  ^(p  -  2)  <  d).  So,  we  need  d>l.Ud<l,  then  d  =  0.  Now 
/3(p  -  2)  <  d  and  /3(p  -  1)  ><f  imply  p  =  2.  One  may  verify  that  when  p  =  2,  the 
LR(ii)  rotations  restore  balance. 

If  the  loss  of  /3-balance  at  gp  is  the  result  of  a  deletion  (say  from  its  right  subtree), 
then  ^(p  -  \)  <  d  +  1  {as  gp  was  /3-balanced  prior  to  the  delete).  For  the  substep 
(ii)  rotation  to  accomplish  the  rebalancing,  we  need  y9(p  -  1)  <  {p  +  l)d.  This  is 
guaranteed  if  <f  +  1  <  (^-|-  l)<i  or  >  1//3.  When  d  <  \//3  and  ^  >  1/3,  d  <  2.  Since 
I3{p  -  I)  <  d  +  1  and  13  >  1/3,  when     =  2,  p  <  10;  when  J  =  1,  p  <  7;  and  when 
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d  =  0,  p  <  i.  We  may  verify  that  for  all  these  cases,  the  LR(ii)  rotations  restore 
balance.  Hence,  the  only  problematic  case  is  when  /9  <  1/3  and  <i  < 

When  ^  <  1/3,  an  LL  rotation  fails  to  restore  balance  only  when  d  =  0  {see 
discussion  following  Theorem  10).  So  we  need  to  rearrange  the  at  most  1/0  +  2 
nodes  in  gp  into  any  /3-balanced  tree  when  d  =  0.  An  LR  rotation  fails  only  when 

d  <  _J  1    To  see  this,  note  that  in  the  terminology  of  Lemma  14,  d  is  d'. 

The  proof  of  P2  is  extended  to  the  case  13  <  1/3  when  d'  >  -^^^^  -  1.  Also,  since 
d'  <  1/13,  for  the  case  6  >  c,  we  get  0{d'  -  1)  <  1  -  ^  <  c  (as  c  >  1).  For  the 
case  6  <  c,  we  need  to  show  l3{a  -  I)  <  b.  Since  an  LR  rotation  is  done  only  when 
condition  Dl  V  D2  holds,  from  Lemmas  10  and  11,  it  follows  that  0{a  -  I)  <  b.  So, 
an  LR  rotation  rebalances  when  ^  <  1/3  provided  d  >  j^^^  -  1.  For  smaller  d,  the 
at  most  +  +  1  nodes  in  the  subtree  gp  may  be  directly  rearranged  into 

a  ;9-balanced  tree. 

The  restructuring  algorithm  for  simple  ^-BBSTs  is  given  in  Figures  3.11  and  3.12. 
The  algorithm  of  Figure  3.11  is  used  following  an  insert  and  that  of  Figure  3.12  after 
a  delete. 

Simple  /3-BBSTs  are  expected  to  have  higher  search  cost  than  the  ^-BBSTs 
of  Section  3.4.  However,  they  are  a  good  alternative  to  traditional  WB(a)  trees  as 
they  are  expected  to  be  "better  balanced".  To  see  this,  note  that  from  the  proof  of 
Lemma  3,  the  balance,  B{p),  at  any  node  p  in  a  /3-baIanced  tree  satisfies 


1       ^    j_^5(r)  +  l 


B{p)  5(0  +  1 


procedure  Restructuring2  ; 
begin 

while  {gp)  do 
begin 

if  {^{s{gp.left)  -  1)  >  s{gp.right))  then  {do  an  LL  or  LR  rotation) 
begin 

P  =  9P-^eft  ; 

if  {{s{p.right)  <  (1  +  0)s{p.left)  +  1  -  ^)  and 

{s{gp.right)  <  s{p.left)))  then 

do  LL  rotation 
else  do  LR  rotation  ; 
end 
else 

do  symmetric  to  the  above  L  case  ; 
gp  =  gp.parent  ; 
end  ; 
end  ; 

Figure  3.11.  Simple  restructuring  procedure  for  insertion 

procedure  Restructurings  ; 
begin 

while  {gp)  do 
begin 

if  i^sigp.left)  -  1)  >  sigp.right))  then 

if      <  1/3)  and  (sigp.right)  <  l//?(2  +  ^)  -  1)  then 

rearrange  the  subtree  rooted  at  gp  into  any  /3- balanced  tree 
else  {do  an  LL  or  LR  rotation} 

begin 

P  =  gp-^^fi ; 

if  {{sip.right)  <  (1  +  ^)s{p.left)  +  1  -  /S)  and 

[sigp.right)  <  sip.left)))  then 

do  LL  rotation 
else  do  LR  rotation  ; 
end 

end 

else 

do  symmetric  to  the  above  L  case  ; 
gp  =  gp.parent  ; 
end  ; 
end  ; 

Figure  3.12.  Simple  restructuring  procedure  for  deletion 
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1 

+  ;3(,(r)+l) 


1  +  ^  + 


2/3-1 


T  /?(5(r)4-l) 
P  "f  /3(»(r)+l) 


So, 

B(p)  <  1  -  .    ,    1    /    2/3-1  • 
i  +  /3  +  /3(*(r)+l) 

Also,  since  s{r)  -  1  <  5(/)//3,  ^(r)  +  1  <  s{l)/^  +  2.  Hence,  1  +  ^  <  1  +  + 
So, 


,(/)+r 


^l^')  ^  Tin — __L_  +  -j- 


1  1  1  I   2/?-l  • 

i  +  /3  +  /3(»(/)+l) 


Consequently, 


,     ^  /  <  B{P)  <  1  -  ,  ,  1  /  2/3-1 

1  +  ^  +  ^WO+T)  ^  +  /?  +  ^(3(r)+l) 


When/?  =  V2-  1, 

1  1 

 — — ^  <  B{p)  <  1  -  ^ 

If  s(p)  <  10,  0.296  <  B(p)  <  1  -  0.296.  So,  every  /3-balanced  subtree  with  10  or 
fewer  nodes  is  in  WB(a)  for  a  «  0.296.  Similarly,  every  subtree  with  100  or  fewer 
nodes  is  in  WB(a)  for  a  «  0.293.  In  fact,  for  every  fixed  fc,  subtrees  of  size  k  or  less 
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procedure  Restructuring4  ; 
begin 

while  (gp)  do 
begin 

if  {s{gp.left)  >  s{gp.right))  then 

begin  {check  conditions  for  an  LL  and  LR  rotation} 
p  =  gp.left  ; 

if  is{p.left)  >  s{p.right))  and  {s{p.left)  >  s{gp.right))  then 
do  LL  rotation 

else  if  (sip.left)  <  s{p.right))  and  {s{p.right)  >  s{gp.right))  then 
do  LR  rotation  ; 

end 

else  {check  conditions  for  an  RR  and  RL  rotation} 

do  symmetric  to  the  above  L  case  ; 
gp  =  gp.parent  ; 
end  ; 
end  ; 

Figure  3.13.  Simple  restructuring  procedure  without  a  l3  value 
are  in  WB(a)  for  a  shghtly  higher  than  1  -  ^  «  0.2929  which  is  the  largest  value 
of  a  for  which  WB(a)  trees  can  be  maintained. 

3  fi    RBSTs  without  Deletion 
In  some  applications  of  a  dictionary,  we  need  to  support  only  the  insert  and 
search  operations.  In  these  apphcations,  we  can  construct  binary  search  trees  with 
total  cost 

C7(r)<nlog^(v^(n  +  l)) 
by  using  the  simpler  restructuring  algorithm  of  Figure  3.13. 

Theorem  12  When  the  only  operations  are  search  and  insert  and  restructuring  is  done 
as  in  Figure  3.13,  C{T)  <  n\og^{y/b{n  +  1)). 
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Proof  Suppose  T  currently  has  m  -  1  elements  and  a  new  element  is  inserted.  Let 
u  be  the  level  at  which  the  new  element  is  inserted.  Suppose  that  the  restructuring 
pass  performs  rotations  at  9  <  u  of  the  nodes  on  the  path  from  the  root  to  the  newly 
inserted  node.  Then  C{T)  increases  by  at  most  v  =  u-qas&  result  of  the  insertion. 
The  number  of  nodes  on  the  path  from  the  root  to  the  newly  inserted  node  at  which 
no  rotation  is  performed  is  also  v.  Let  these  nodes  be  numbered  1  through  t;  bottom 
to  top.  Let  Si  denote  the  number  of  elements  in  the  subtree  with  root  i  prior  to  the 
restructuring  pass.  We  see  that  Si  >  1  and  S2  >  2.  For  node  i,  2  <  i  <  u,  one  of  its 
subtrees  contains  node  i  -  1.  Without  loss  of  generality,  let  this  be  the  left  subtree 
of  i.  Let  the  root  of  the  right  subtree  of  t  be  d.  So, 

5.  >  Si-i  +  s{d)  +  1. 

If  I  -  1  is  not  the  left  child  of  i,  then  since  no  rotation  is  done  at  t,  s{d)  >  If 
z  -  1  is  the  left  child  of  i,  then  consider  node  i  -  2.  This  is  in  one  of  the  subtrees  of 
i.  Since  no  rotation  is  performed  at  t  -  1,  s{d)  >  5',_2-  Since        >  5,_2,  we  get 

Si  >  Si-i  +  Si-2  +  1. 

Hence,  S„  >  N^,  where  is  the  minimum  number  of  elements  in  a  COST  of 
height  V.  So,  v  <  log^(\/5(m  +  1)).  So,  when  an  element  is  inserted  into  a  tree  that 
has  m  -  1  elements,  its  cost  C{T)  increases  by  at  most  log^(\/5(m  +  1)).  Starting 
with  an  empty  tree  and  inserting  n  elements  results  in  a  tree  whose  cost  is  at  most 
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nlog^(v/5(n  +  l)). 

Corollary  2  The  expected  cost  of  a  search  or  insert  in  a  BEST  constructed  as  above 
is  O(logn). 

Proof  Since  C{T)  <  nlog^(v/5(n  +  1)),  the  expected  search  cost  is  C{T)/n  < 
log^(>/5(n  +  1)).  The  cost  of  an  insert  is  the  same  order  as  that  of  a  search  as  each 
insert  follows  the  corresponding  search  path  twice  (top  down  and  bottom  up).  □ 

3.7    Experimental  Results 

For  comparison  purposes,  we  wrote  C  programs  for  BBSTs,  SBBSTs  (simple 
BBSTs),  BBSTDs  (BBSTs  in  which  procedure  Restructuring4  (Figure  3.13)  is  used 
to  restructure  following  inserts  as  well  as  deletes),  unbalanced  binary  search  trees 
(BST),  AVL-trees,  top-down  red-black  trees  (RB-T),  bottom-up  red-black  trees  (RB- 
B)  [31],  weight  balanced  trees  (WB),  deterministic  skip  lists  (DSL),  treaps  (TRP), 
and  skip  lists  (SKIP).  For  the  BBST  and  SBBST  structures,  we  used  ^  =  207/500 
while  for  the  WB  structure,  we  used  a  =  207/707.  While  these  are  not  the  highest 
permissible  values  of  /3  and  o,  this  choice  permitted  us  to  use  integer  arithmetic 
rather  than  the  substantially  more  expensive  real  arithmetic.  For  instance,  0-{a,b) 
for  ^  =  207/500  can  be  checked  using  the  comparisons  207(5(a)  -  1)  >  5005(6) 
and  207(5(6)  -  1)  >  5005(a).  The  randomized  structures  TRP  and  SKIP  used  the 
same  random  number  generator  with  the  same  seed.  SKIP  was  programmed  with 
probability  value  p  =  1/4  as  in  Pugh  [26]. 
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To  minimize  the  impact  of  system  call  overheads  on  run  time  measurements, 
we  programmed  all  structures  using  simulated  pointers  (i.e.,  an  array  of  nodes  with 
integer  pointers  [27].  Skip  lists  use  variable  size  nodes.  This  requires  more  complex 
storage  management  than  required  by  the  remaining  structures  which  use  nodes  of 
the  same  size.  For  our  experiments,  we  implemented  skip  lists  using  fixed  size  nodes, 
each  node  being  of  the  maximum  size.  As  a  result,  our  run  times  for  skip  lists  are 
smaller  than  if  a  space  efficient  implementation  had  been  used.  In  all  our  tree  struc- 
ture implementations,  null  pointers  were  replaced  by  a  pointer  to  a  tail  node  whose 
data  field  could  be  set  to  the  search/insert/delete  key  and  thus  avoid  checking  for 
falling  off  the  tree.  Similar  tail  pointers  are  part  of  the  defined  structure  of  skip  and 
deterministic  skip  fists.  Each  tree  also  had  a  head  node.  WB(a)  trees  were  imple- 
mented with  a  bottom-up  restructuring  pass.  Our  codes  for  SKIP  and  DSL  are  based 
on  the  codes  of  Pugh  [26]  and  Papadakis  [22],  respectively.  Our  AVL  and  RB-T  codes 
are  based  on  those  of  Papadakis  [22]  and  Sedgewick  [28].  The  treap  structure  was 
implemented  using  joins  and  splits  rather  than  rotations.  This  results  in  better  per- 
formance. Furthermore,  AVL,  RB-B,  WB,  and  BBST  were  implemented  with  parent 
pointers  in  addition  to  left  and  right  child  pointers.  For  BBSTs,  the  enhancements 
described  in  Section  3.4.4  for  insert  and  delete  (see  Figure  3.10)  were  employed.  No 
rotations  were  performed  during  a  search  when  using  any  of  the  structures. 

For  our  experiments,  we  tried  two  versions  of  the  code.  These  varied  in  the 
order  in  which  the  'equality'  and  'less  than'  or  'greater  than'  check  between  x  and  e 
(where  x  is  the  key  being  searched/inserted/deleted  and  e  is  the  key  in  the  current 
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node)  is  done.  In  version  1,  we  conducted  an  initial  experiment  to  determine  if  the 
total  comparison  count  is  less  using  the  order  L: 
if  X  <  e  then  move  to  left  child 
else  if  I  /  e  then  move  to  right  child 
else  found 
or  the  order  R: 

if  a:  >  e  then  move  to  right  child 
else  if  I  7^  e  then  move  to  left  child 
else  found. 

Our  experiment  indicated  that  doing  the  'left  child'  check  first  (i.e.  order  L)  worked 
better  for  AVL,  BEST,  BBSTD,  and  DSL  structures  while  R  worked  better  for  the 
RB-T,  RB-B,  WB,  SBBST,  and  TRP  structures.  No  significant  difference  between  L 
and  R  was  observed  for  BSTs.  For  skip  lists,  we  do  not  have  the  flexibility  to  change 
the  comparison  order.  The  version  1  codes  performed  the  comparisons  in  the  order 
determined  to  be  better.  For  BSTs,  the  order  R  was  used. 

In  the  version  2  codes  the  comparisons  in  eax:h  node  took  the  standard  form 
if  X  =  e  then  found 
else  if  X  <  e  then  move  to  left  child 
else  move  to  right  child. 
The  version  2  restructuring  code  for  BBSTs  differed  from  that  of  Figure  3.10  in 
that  the  '>'  test  in  the  second,  third,  and  forth  if  statements  was  changed  to  '>'. 
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No  change  was  made  in  the  corresponding  if  statements  for  RR  and  RL  rotations. 
While  this  increased  the  number  of  comparisons,  it  reduced  the  run  time. 

We  experimented  with  n  =  10,000,  50,000,  100,000,  and  200,000.  For  each  n, 
the  following  experiments  were  conducted: 

(a)  start  with  an  empty  structure  and  perform  n  inserts; 

(b)  search  for  each  item  in  the  resulting  structure  once;  items  are  searched  for  in  the 
order  they  were  inserted 

(c)  perform  an  alternating  sequence  of  n  inserts  and  n  deletes;  in  this,  the  n  elements 
inserted  in  (a)  are  deleted  in  the  order  they  were  inserted  and  n  new  elements  are 
inserted 

(d)  search  for  each  of  the  remaining  n  elements  in  the  order  they  were  inserted 

(e)  delete  the  n  elements  in  the  order  they  were  inserted. 

For  each  n,  the  above  five  part  experiment  was  repeated  ten  times  using  different 
random  permutations  of  distinct  elements.  For  each  permutation,  we  measured  the 
total  number  of  element  comparisons  performed  and  then  averaged  these  over  the  ten 
permutations. 

First,  we  report  on  the  relative  performance  of  SBBSTs,  BBSTDs,  and  BB- 
STs.  For  this  comparison,  we  used  only  version  1  of  the  code.  Table  3.1  gives  the 
average  number  of  key  comparisons  performed  for  each  of  the  five  parts  of  the  ex- 
periment. The  three  versions  of  our  proposed  data  structure  are  very  competitive 
on  this  measure.  BBSTDs  and  BBSTs  generally  performed  fewer  comparisons  than 
did  SBBSTs.  All  three  structures  had  a  comparison  count  within  2%  of  one  another. 
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Table  3.1.  The  number  of  key  comparisons  on  random  inputs  (version  1  code) 


n 

operation 

SBBST 

BBSTD 

BBST 

insert 

212223 

212111 

cf^iA  IT"  n 

194661 

191599 

191578 

1U,UUU 

ins/  aei 

416967 

416862 

194957 

191666 

191676 

166441 

166487 

insert 

1936682 

1236114 

sedTcu 

1 152137 

1135131 

1134969 

OU,UUU 

ins/  aei 

94*^791 8 

2438083 

2437639 

1153821 

1134277 

1134062 

tieiete 

1018675 

1007766 

1007688 

insert 

2624829 

2623792 

searcn 

9458079 

2423988 

2423613 

1UU,UUU 

ins/del 

5180383 

5179653 

search 

2461221 

2420282 

2419990 

delete 

2190798 

2168049 

2168110 

insert 

5580139 

5555190 

5553256 

search 

5223989 

5148220 

5147698 

200,000 

ins/del 

10981441 

10969578 

10968053 

search 

5229172 

5144808 

5144148 

delete 

4692447 

4641349 

4641389 

However,  when  we  used  ordered  data  rather  than  random  data  (Table  3.2),  SBBSTs 
performed  noticeably  inferior  to  BBSTDs  and  BBSTs;  the  later  two  remained  very 
competitive. 

Tables  3.3  and  3.4  give  the  average  heights  of  the  trees  using  random  data  and 
using  ordered  data,  respectively.  The  first  number  gives  the  height  following  part  (a) 
of  the  experiment  and  the  second  following  part  (c).  The  numbers  are  identical  for 
BBSTDs  and  BBSTs  and  slightly  higher  (lower)  for  SBBSTs  using  random  (ordered) 
data. 


Table  3.2.  The  number  of  key  comparisons  on  ordered  inputs  (version  1 


n 

operation 

SBBST 

BBSTD 

BBST 

1  n  COT*T 

1  7ni  89 

1 50554 

150554 

oCa.1  Cli 

1 88722 

185530 

185530 

1  n  nnn 

ins/  ciei 

315177 

%J  XxJ  X  t  1 

314998 

191681 

184155 

184155 

215214 

135311 

135131 

QQl ^9fi 

879967 

872967 

1 1 1 7174 

1101481 

1101481 

ou,uuu 

ins/  Gci 

1 806346 

1805439 

1116390 

X  X  J.       V-r  V 

1098065 

1098065 

19777'Sfi 

792717 

791815 

insert 

91 0*^808 

1850^48 

1850548 

9*^84327 

2354757 

2354757 

1  HA  nnn 

lUUjUUU 

ins/  aei 

^94Q1Q4 

3893415 

3821594 

search 

2382759 

2346118 

2346128 

delete 

2738294 

1686397 

1684584 

insert 

4449143 

3903083 

3903083 

search 

5068632 

4946753 

4946753 

200,000 

ins/del 

11105525 

8051695 

8048058 

search 

5065496 

5001967 

5001967 

delete 

5842168 

3580856 

3577223 

Table  3.3.  Height  of  the  trees  on  random  inputs  (version  1  code) 


n 

SBBST 

BBSTD 

BBST 

10,000 

17,17 

16,16 

16,16 

50,000 

20,20 

19,19 

19,19 

100,000 

21,21 

20,20 

20,20 

200,000 

22,23 

21,21 

21,21 

Table  3.4.  Height  of  the  trees  on  ordered  inputs  (version  1  code) 


n 

SBBST 

BBSTD 

BBST 

10,000 

16,15 

17,17 

17,17 

50,000 

20,20 

20,20 

20,20 

100,000 

21,21 

21,21 

21,21 

200,000 

22,22 

23,22 

23,22 
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The  average  number  of  rotations  performed  by  each  of  the  three  structures  is 
given  in  Tables  3.5  and  3.6.  A  single  rotation  (i.e.,  LL  or  RR)  is  denoted  'S'  and  a 
double  rotation  (i.e.,  LR  or  RL)  denoted  'D'.  In  the  case  of  BBSTs,  double  rotations 
have  been  divided  into  three  categories:  D  =  LR  and  RL  rotations  that  do  not  perform 
a  second  substep  rotation;  DS  =  LR  and  RL  rotations  with  a  second  substep  rotation 
of  type  LL  and  RR;  DD  =  LR  and  RL  rotations  with  a  second  substep  rotation  of 
type  LR  and  RL.  BBSTDs  and  BBSTs  performed  a  comparable  number  of  rotations 
on  both  data  sets.  However,  on  random  data  SBBSTs  performed  about  haK  as  many 
rotations  as  did  BBSTDs  and  BBSTs.  On  ordered  data,  SBBSTs  performed  15  to 
20%  fewer  rotations  on  part  (a),  34%  fewer  on  part  (c),  and  51%  fewer  on  part  (e). 

The  run-time  performance  of  the  structures  is  significantly  influenced  by  com- 
piler and  architectural  features  as  well  as  the  complexity  of  a  key  comparison.  The 
results  we  report  are  from  a  SUN  SPARC-5  using  the  UNIX  C  compiler  cc  with 
optimization  option.  Because  of  instruction  pipelining  features,  cache  replacement 
policies,  etc.,  the  measured  run  times  are  not  always  consistent  with  the  compiler 
and  architecture  independent  metrics  reported  in  Tables  3.1  through  3.6  and  later 
in  Tables  3.11  through  3.16.  For  example,  since  the  search  codes  for  all  tree  based 
methods  are  essentially  identical,  we  would  expect  methods  with  a  smaller  compar- 
ison count  to  have  a  smaller  run  time  for  parts  (b)  and  (d)  of  the  experiment.  This 
was  not  always  the  case. 

Tables  3.7  and  3.8  give  the  run  times  of  the  three  BBST  structures  using  integer 
keys  and  Tables  3.9  and  3.10  do  this  for  the  case  of  real  (i.e.,  floating  point)  keys.  The 


BEST 

DD 

CO  g  oo 

05  2  CM 

lO  T— 1  r-H 

lO  CO  CO 

tJ<  lO  >— < 

920 

1046 

260 

1840 
2059 
523 

DS 

r-H    CM  . 

iC  00  ^ 
r-(   CM  ^ 

754 
1161 

94.8 

1527 
2275 

3054 
4593 
QQO 

Q 

3938 
5849 
2018 

19732 
29198 
10033 

39461 
58491 
20147 

79013 

116940 

40157 

CD 

5025 

10104 

5201 

25059 
50979 
26068 

50047 

101836 

51943 

100205 
203568 
103826 

BBSTD 

Q 

4314 
6311 
2104 

21596 
31499 
10462 

43230 
62967 
21022 

86605 

125960 

41884 

CO 

5045 

10158 

5235 

25216 
51238 
26214 

50283 

102218 

52227 

100664 
204459 
104344 

SBBST 

Q 

2220 
3216 
1110 

11120 
16125 
5648 

22262 
32203 
11306 

44525 
64417 
22551 

CO 

2341 
4269 
1607 

11719 
21330 
8058 

23450 
42780 
16095 

46934 
85283 
32233 

operation 

insert 
ins/del 
delete 

insert 
ins/del 
delete 

insert 
ins/del 
delete 

insert 
ins/del 
delete 

c 

10,000 

50,000 

100,000 

200,000 

80 


Table  3.6.  The  number  of  rotations  on  ordered  inputs  (version  1  code) 


SBBST 

BBSTD 

BBS! 

n 

operation 

S 

D 

S 

D 

S 

D 

DD 

10,000 

insert 
ins/del 
delete 

QQS4 

14997 
4989 

0 
0 
0 

9985 

16567 

6570 

2387 
6130 
3726 

9985 

16644 

6647 

2387 
5797 
3392 

0 

25 
2d 

0 

154 

l04 

50,000 

insert 
ins/del 
delete 

49980 
74996 
24987 

0 
0 
0 

49983 
82862 
32859 

11956 
30659 
18686 

49983 
83247 
33242 

11956 
17018 

0 

137 
136 

0 

770 
766 

100,000 

insert 
ins/del 
delete 

99979 

149996 

49986 

0 
0 
0 

99983 

165738 

65733 

23917 
61327 
37392 

99983 

166504 

66505 

23917 
57969 
34040 

0 

280 
278 

0 

1540 
1536 

200,000 

insert 
ins/del 
delete 

199978 
299996 
99985 

0 
0 
0 

199982 
331473 
131478 

47839 

122653 

74795 

199982 
333012 
133016 

47839 

115938 

68086 

0 

559 
557 

0 

3078 
3076 

sum 


of  the  run  time  for  parts  (a)  -  (e)  of  the  experiment  is  graphed  in  Figure  3.14. 
For  random  data,  SBBSTs  significantly  and  consistently  outperformed  BBSTDs  and 
BBSTs.  On  ordered  data,  however,  BBSTDs  were  slightly  faster  than  BBSTs  and 
both  were  significantly  faster  than  SBBSTs. 

Since  BBSTs  generated  trees  with  the  least  search  cost,  we  expect  BBSTs  to 
outperform  SBBSTs  and  BBSTDs  in  applications  where  the  comparison  cost  is  very 
high  relative  to  that  of  other  operations  and  searches  are  done  with  a  much  higher 
frequency  than  inserts  and  deletes.  However,  with  the  mix  of  operations  used  in 
tests,  SBBSTs  are  the  clear  choice  for  random  inputs  and  BBSTDs  for  ordered 


our 


inputs. 

In  comparing  with  the  other  structures,  our  tables  repeat  the  data  for  BBSTs. 
The  reader  may  make  the  comparison  with  SBBSTs  and  BBSTDs. 
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Table  3.7.  Run  time  on  random  inputs  using  integer  keys  (version  1  code) 


n 

operation 

SBBST 

BBSTD 

BBST 

lusen 

0.30 

0.34 

searcii 

0  06 

0.06 

0.07 

iU,UUU 

llio  /  yiKZL 

0.57 

0.62 

0.70 

searcij 

0  06 

0.06 

0.06 

delete 

0.22 

0.25 

0.26 

insert 

1.48 

1.61 

1.75 

search 

0.35 

0.36 

0.37 

50,000 

ins/del 

2.90 

3.47 

3.84 

search 

0.36 

0.38 

0.39 

delete 

1.13 

1.47 

1.62 

insert 

3.00 

3.57 

3.80 

search 

0.78 

0.83 

0.84 

100,000 

ins/del 

6.28 

7.78 

8.41 

search 

0.83 

0.87 

0.88 

delete 

2.54 

3.31 

3.58 

insert 

6.56 

7.74 

8.37 

search 

1.80 

1.89 

1.89 

200,000 

ins/del 

13.89 

17.32 

18.57 

search 

1.86 

1.98 

1.98 

delete 

5.64 

7.41 

8.02 

Time  Unit  :  sec 
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Table  3.8.  Run  time  on  ordered  inputs  using  integer  keys  (version  1  code) 


n 

operation 

SBBST 

BBSTD 

BBST 

0.32 

0.20 

0.27 

oCcLl  V-l-l 

0.05 

0.03 

0.05 

10,000 

ins  /  del 

0.58 

0.43 

0.57 

0.07 

0.03 

0.03 

delete 

0.20 

0.17 

0.23 

insert 

1.38 

1.20 

1.10 

search 

0.25 

0.20 

0.20 

50,000 

ins/del 

2.63 

2.18 

2.40 

search 

0.25 

0.20 

0.20 

delete 

0.95 

0.92 

1.05 

insert 

3.43 

2.23 

2.53 

search 

0.72 

0.45 

0.42 

100,000 

ins/del 

5.97 

4.70 

5.13 

search 

0.55 

0.47 

0.42 

delete 

2.10 

1.98 

2.15 

insert 

6.65 

4.95 

5.25 

search 

1.20 

0.92 

0.90 

200,000 

ins /del 

13.13 

10.23 

10.88 

search 

1.17 

0.90 

0.90 

delete 

4.63 

4.25 

4.58 

Time  Unit  :  sec 


Table  3.9.  Run  time  on  random  real  inputs  (version  1  code) 


n 

operation 

SBBST 

BBSTD 

BBST 

insert 

0.23 

0.34 

0.36 

search 

0.07 

0.10 

0.10 

10,000 

ins/del 

0.44 

0.75 

0.79 

search 

0.08 

0.10 

0.10 

delete 

0.17 

0.29 

0.30 

insert 

1.43 

1.76 

1.93 

search 

0.47 

0.53 

0.52 

50,000 

ins/del 

2.76 

3.89 

4.22 

search 

0.50 

0.54 

0.55 

delete 

1.13 

1.62 

1.76 

insert 

2.96 

3.94 

4.36 

search 

1.08 

1.17 

1.16 

100,000 

ins/del 

6.11 

8.58 

9.30 

search 

1.12 

1.20 

1.22 

delete 

2.50 

3.66 

3.95 

insert 

6.85 

8.92 

9.33 

search 

2.41 

2.58 

2.57 

200,000 

ins/del 

13.86 

19.49 

20.46 

search 

2.49 

2.69 

2.66 

delete 

5.61 

8.25 

8.80 

Time  Unit  :  sec 


Table  3.10.  Run  time  on  ordered  real  inputs  (version  1  code) 


n 

operation 

SBBST 

BBSTD 

BBST 

insert 

0.27 

0.23 

0.20 

search 

0.08 

0.07 

0.07 

10,000 

ins/del 

0.53 

0.50 

0.43 

search 

0.08 

0.07 

0.05 

delete 

0.18 

0.23 

0.20 

insert 

1.43 

1.25 

1.12 

search 

0.40 

0.30 

0.30 

50,000 

ins/del 

2.80 

2.17 

2.37 

search 

0.40 

0.30 

0.30 

delete 

1.07 

0.90 

0.97 

insert 

3.28 

2.58 

2.77 

search 

0.90 

0.62 

0.63 

100,000 

ins/del 

6.15 

4.70 

5.13 

search 

0.87 

0.62 

0.63 

delete 

2.35 

1.93 

2.10 

insert 

7.37 

4.55 

4.92 

search 

1.85 

1.32 

1.32 

200,000 

ins/del 

13.35 

10.03 

10.93 

search 

1.87 

1.33 

1.33 

delete 

5.08 

4.17 

4.43 

Time  Unit  :  sec 
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Time  is  sum  of  time  for  parts  (a)-(e)  of  the  experiment 


n 


Figure  3.14.  Run  time  on  real  inputs  (version  1  code) 

The  average  number  of  comparisons  for  each  of  the  five  parts  of  the  experiment 
are  given  in  Table  3.11  for  the  version  1  implementation.  On  the  comparison  measure, 
AVL,  RB-B,  WB,  and  BBSTs  are  the  front  runners  and  are  quite  competitive  with 
one  another.  On  parts  (a)  (insert  n  elements)  and  (c)  (insert  n  and  delete  n  elements), 
AVL  trees  performed  best  while  on  the  two  search  tests  ((b)  and  (d))  and  the  deletion 
test  (e),  BBSTs  performed  best. 

Table  3.12  gives  the  number  of  comparisons  performed  when  ordered  data  (i.e., 
the  elements  in  part  (a)  are  1,2, ...  ,n  and  are  inserted  in  this  order)  and  those  in 
part  (c)  are  n  +  1, . . .  ,2n  (in  this  order)  is  used  instead  of  random  permutations  of 
distinct  elements.  This  experiment  attempts  to  model  realistic  situations  in  which 
the  inserted  elements  are  in  "nearly  sorted  order".  BSTs  were  not  included  in  this 
test  as  they  perform  very  poorly  with  ordered  data  taking  O(n^)  time  to  insert  n 
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times.  The  computer  time  needed  to  perform  this  test  on  BSTs  was  determined 
to  be  excessive.  This  test  exhibited  greater  variance  in  performance.  Among  the 
deterministic  structures,  BBSTs  outperformed  the  others  in  parts  (a)  -  (d)  while 
AVL  trees  were  ahead  in  part  (e).  For  part  (a),  BBSTs  performed  approximately 
45%  fewer  comparisons  than  did  AVL  trees  and  approximately  12%  fewer  than  WB 
trees.  The  randomized  structure  TRP  was  the  best  of  the  eight  structures  reported 
in  Table  3.12  for  part  (a).  It  performed  approximately  10%  fewer  comparisons  than 
did  BBST  trees.  However,  the  BBST  remained  best  overall  on  parts  (b),  (c),  and 

The  heights  of  the  trees  (number  of  levels  in  the  case  of  DSL  and  SKIP)  for 
the  experiments  with  random  and  ordered  data  are  given  in  Tables  3.13  and  3.14 
respectively.  The  first  number  in  each  table  entry  is  the  tree  height  after  part  (a)  of 
the  experiment  and  the  second,  the  height  after  part  (c).  In  all  cases,  the  number  of 
levels  using  skip  lists  is  fewest.  However,  among  the  tree  structures,  AVL  and  BBST 
trees  have  least  height  on  random  data  and  AVL  has  least  with  ordered  data. 

Tables  3.15  and  3.16,  respectively,  give  the  number  of  rotations  performed  by 
each  of  the  deterministic  tree  schemes  for  experiment  parts  (a),  (c),  and  (e).  Note 
that  none  of  the  schemes  performs  rotations  during  a  search. 

On  ordered  data,  BBSTs  perform  about  25%  more  rotations  than  do  the  re- 
maining structures.  These  remaining  structures  perform  about  the  same  number  of 
rotations.  On  random  data,  AVL  trees,  bottom-up  red-black  trees  and  WB  trees  per- 
form a  comparable  number  of  rotations.  Top-down  red-black  trees  and  BBST  trees 


SKIP 

224757 
255072 
519430 
256124 
231745 

1357076 
1537547 
2996512 
1501731 
1373858 

2919371 
3188621 
6399463 
3225343 
2981173 

6178596 

6697223 

13377747 

6680642 

6149268 

TRP 

296866 
258662 
601137 
254119 
242743 

1717037 
1503452 
3456045 
1497081 
1451835 

3632046 
3247143 
7476441 
3310823 
3177135 

7682439 

6797942 

15543559 

6916150 

6700557 

DSL 

276247 
258089 
923524 
256578 
526242 

1640660 
1512093 
5351715 
1499657 
3077266 

3513401 

3244497 

11545200 

3229747 

6561272 

7483199 
6887196 
24207106 
6814733 
13811271 

BBST 

212111 
191578 
416862 
191676 
166487 

1236114 
1134969 
2437639 
1134062 
1007688 

2623792 
2423613 
5179653 
2419990 
2168110 

5553256 

5147698 

10968053 

5144148 

4641389 

WB 

211916 
194153 
414635 
194442 
167531 

1238628 
1149970 
2431281 
1151578 
1015988 

2631411 
2453855 
5170695 
2456748 
2185213 

5571133 

5215568 

10956496 

5220965 

4680768 

RB-B 

211886 
194291 
414990 
195525 
167455 

1236968 
1146754 
2424944 
1152764 
1013144 

2626314 
2446466 
5154118 
2457531 
2177946 

5558174 

5199786 

10921880 

5223154 

4664344 

RB-T 

262838 
194606 
515184 
197399 
200218 

1550701 
1150466 
3058045 
1173662 
1242426 

3305332 
2451137 
6564352 
2502098 
2692672 

7016676 

5209189 

13940982 

5332771 

5800203 

AVL 

211401 
193253 
411220 
193141 
167312 

1234911 
1147273 
2417733 
1145808 
1013535 

2623894 
2445659 
5137280 
2443038 
2181327 

5553640 

5191730 

10862426 

5186737 

4664876 

BST 

264175 
254175 
516853 
252200 
215555 

1560958 
1510958 
3061868 
1500504 
1316917 

3329780 
3229780 
6537563 
3208453 
2839934 

7076132 

6876132 

13907058 

6830718 

6095324 

operation 

insert 

search 

ins/del 

search 

delete 

insert 

search 

ins/del 

search 

delete 

insert 

search 

ins/del 

search 

delete 

insert 

search 

ins/del 

search 

delete 

c 

10,000 

50,000 

100,000 

200,000 
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C 
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'C 
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a 
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V 
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V 

H 

CO 
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SKIP 

247129 
256706 
354566 
250538 
84392 

1422120 
1467217 
1973416 
1449810 
486498 

2925618 
2970715 
4406427 
3277089 
961283 

6403207 
6448304 
9062233 
6458321 
1995215 

TRP 

135989 
271087 
390899 
269031 
193080 

825390 

1540082 

2194668 

1568903 

1181612 

1724473 
3564282 
4438266 
3281308 
2403622 

3428355 
7174727 
9054078 
7006341 
5094044 

DSL 

435199 
262423 
983676 
249694 
468244 

2585557 
1509152 
6019215 
1481819 
2785792 

5521408 

3218246 

12788447 

3163554 

5971196 

11743159 
6836428 
27076911 
6727017 
12741948 

BBST 

150554 
185530 
314998 
184155 
135131 

872967 

1101481 

1805439 

1098065 

791815 

1850548 
2354757 
3821594 
2346128 
1684584 

3903083 
4946753 
8048058 
5001967 
3577223 

WB 

171017 
188722 
425843 
191681 
214930 

995720 

1117174 

2475487 

1116390 

1276262 

2112201 
2384327 
5254541 
2382759 
2735270 

4465935 

5068632 

11116226 

5065496 

5836096 

CD 
CQ 

241383 
190106 
508810 
190090 
218216 

1436225 
1120495 
3055100 
1126126 
1323918 

3072389 
2390961 
6510188 
2402224 
2847792 

6544713 

5081891 

13820364 

5104418 

6095538 

RB-T 

376228 
188246 
718040 
189494 
276136 

2233658 
1117001 
4311748 
1168633 
1719212 

4767564 
2383979 
9223606 
2487243 
3737982 

10135418 

5067933 

19647336 

5274461 

8075474 

AVL 

277234 
191917 
421032 
195133 
104038 

1618930 
1120497 
2418422 
1124001 
607478 

3437858 
2390963 
5111850 
2397971 
1289954 

7275714 

5081893 

10773706 

5095909 

2729906 

operation 

insert 

search 

ins/del 

search 

delete 

insert 

search 

ins/del 

search 

delete 

insert 

search 

ins/del 

search 

delete 

insert 

search 

ins/del 

search 

delete 

c 

10,000 

50,000 

100,000 

200,000 
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Table  3.13.  Height  of  the  trees  on  random  inputs  (version  1  code) 


n 

BST 

AVL 

RB-T 

RB-B 

WB 

BBST 

DSL 

TRP 

SKIP 

10,000 
50,000 
100,000 
200,000 

31,31 
38,38 
41,41 
44,43 

16,16 
19,19 
20,20 
21,21 

17,18 
20,21 
21,22 
22,24 

16,17 
19,20 
20,21 
21,22 

17,17 
20,20 
21,22 
23,23 

16,16 
19,19 
20,20 
21,21 

12,11 
13,12 
14,13 
15,14 

32,31 
38,37 
41,40 
43,44 

8,8 
9,9 
9,9 
9,9 

Table  3.14.  Height  of  the  trees  on  ordered  inputs  (version  1  code) 


n 

AVL 

RB-T 

RB-B 

WB 

BBST 

DSL 

TRP 

SKIP 

10,000 
50,000 
100,000 
200,000 

14,14 
16,16 
17,17 
18,18 

20,20 
23,23 
25,25 
27,27 

24,24 
29,28 
31,30 
33,32 

16,15 
20,20 
21,21 
22,22 

17,17 
20,20 
21,21 
23,22 

14,13 
16,16 
17,17 
18,18 

33,34 
41,41 
46,41 
47,46 

8,8 
9,9 
9,9 
9,9 

perform  a  significantly  larger  number  of  rotations.  In  fact,  BBSTs  perform  about 
twice  as  many  rotations  as  AVL  trees. 

The  average  run  times  for  the  random  data  tests  are  given  in  Table  3.17  and 
in  Table  3.18  for  the  ordered  data  test.  Both  of  these  use  integer  keys.  The  times 
using  real  keys  are  given  in  Tables  3.19  and  3.20.  The  sum  of  the  run  time  for  parts 
(b)  and  (d)  of  the  experiment  is  graphed  in  Figure  3.15  for  random  data  and  in 
Figure  3.16  for  ordered  data.  The  graph  of  Figure  3.17  shows  only  one  line  MIX 
for  AVL,  RB-T,  RB-B,  WB,  and  BBST  while  that  of  Figure  3.18  shows  MIX  for 
AVL,  RB-T,  RB-B,  and  WB  as  the  times  for  these  are  very  close.  With  integer 
keys  and  random  data,  unbalanced  binary  search  trees  (BSTs)  outperformed  each 
of  the  remaining  structures.  The  next  best  performance  was  exhibited  by  bottom- 
up  red-black  trees.   They  did  marginally  better  than  AVL  trees.   The  remaining 
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103826 

WB 

Q 
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41480 
59911 
20800 

CO 
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21266 
7963 

22723 
42567 
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45458 
84927 
31984 

RB-B 

Q 

1933 
2591 
1166 

9689 

12979 

5844 

19414 
25919 
11708 

38793 
52030 
23379 

1946 
4053 
1845 

9710 
20255 
9196 

19340 
40618 
18530 

38797 
80892 
37083 

RB-T 

Q 

1955 
8213 
2678 

9815 

45180 

13431 

19677 

103835 

26953 

39291 
209941 
54046 

CD 

1964 

14773 

9558 

9822 

81895 
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247905 

AVL 

Q 
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Time  is  sum  of  time  for  parts  (b)  and  (d)  of  the  experiment 
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Figure  3.15.  Run  time  on  random  real  inputs  (version  1  code) 


structures  have  a  noticeably  inferior  structure.  For  ordered  integer  keys,  BSTs  take 
more  time  than  we  were  willing  to  expend.  Of  the  remaining  structures,  treaps 
generally  performed  best  on  parts  (a),  (c),  and  (e)  while  BBSTs  did  best  on  parts 
(b)  and  (d). 

With  real  keys  and  random  data,  BSTs  did  not  outperform  the  remaining  struc- 
tures. Now,  the  five  balanced  binary  tree  structure  became  quite  competitive  with 
respect  to  the  search  operations  (i.e.,  parts  (b)  and  (d)).  RB-B  generally  outper- 
formed the  other  structures  on  parts  (a),  (c),  and  (e).  Using  ordered  real  keys,  the 
treap  was  the  clear  winner  on  parts  (a),  (c),  and  (e)  while  BBSTs  handily  outper- 
formed the  remaining  structures  on  parts  (b)  and  (d). 

Some  of  the  experimental  results  using  version  2  of  the  code  are  shown  in  Ta- 
bles 3.21-  3.24.  On  the  comparison  measure,  with  random  data  (Table  3.21),  skip 


SKIPJ 

0.24 
0.18 
0.45 
0.18 
0.16 

1.36 
1.25 
2.73 
1.16 
1.10 

2.84 
2.63 
6.13 
2.61 
2.41 

6.25 

5.85 

13.29 

5.81 

5.35 

TRP 

0.18 
0.09 
0  33 
0.09 
0.08 

1.01 
0.56 
1.86 
0.56 
0.51 

2.23 
1.30 
4.10 
1.29 
1.23 

4.70 
2.91 
8  95 
2.93 
2.76 

DSL 

0.19 
0.09 
0.49 
0.09 
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0.58 
2.77 
0.57 
1.16 

2.46 
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5.56 
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BBST 
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0.26 
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3.84 
0.39 
1.62 
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0.84 
8.41 
0.88 
3.58 

8.37 

1.89 

18.57 

1.98 

8.02 

WB 

0.20 
0.05 
0.39 
0.05 
0.16 

1.18 
0.35 
2.22 
0.36 
0.94 

2.54 
0.78 
4.86 
0.83 
2.11 

5.21 
1.78 
10.54 
1.87 
4.55 

RB-B 

0.12 
0.06 
0.22 
0.05 
0.09 

0.73 
0.36 
1.26 
0.36 
0.51 

1.54 
0.83 
2.74 
0.84 
1.14 

3.18 
1.81 
5.99 
1.91 
2.51 

RB-T 

0.15 
0.05 
0.36 
0.05 
0.12 

0.98 
0.36 
2.50 
0.44 
1.01 

2.10 
0.80 
5.52 
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2.26 

4.41 

1.81 
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2.25 

5.06 
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O.Zl 

0.05 
0.08 

0.79 
0.36 

1.4o 
0.41 
0.54 

1.57 
0.80 

0.94 
1.24 

3.37 
1.80 
6.99 
2.12 
2.69 

BST 

0.08 
0.05 

U.14 

0.05 
0.05 

0.65 
0.40 

0.40 
0.39 

1.34 
0.88 

0.93 
0.88 

2.79 
2.00 
0.24 
2.08 
2.01 

operation 

insert 

search 

ins/del 

search 

delete 

insert 

search 

ins/del 

search 

delete 

insert 

search 

ins/del 

search 

delete 

insert 

search 

ins/del 

search 

delete 

C 

I 

10,000 

50,000 

100,000 

200,000 
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Table  3.18.  Run  time  on  ordered  inputs  using  integer  keys  (version  1  code) 


n 

operation 

AVL 

RB-T 

RB-B 

WB 

BEST 

DSL 

TRP 

bKlr 

insert 

0.12 

0.17 

0.12 

0.18 

0.27 

0.23 

0.08 

0.20 

search 

0.05 

0.03 

0.03 

0.07 

0.05 

0.07 

0.05 

0.12 

10,000 

ins/del 

0.18 

0.32 

0.20 

0.35 

0.57 

0.42 

0.17 

0.20 

search 

0.05 

0.05 

0.05 

0.05 

0.03 

0.07 

0.05 

0.13 

delete 

0.05 

0.10 

0.07 

0.13 

0.23 

0.15 

0.05 

0.07 

insert 

U.  <  0 

1  no 

1.25 

1.10 

0.98 

0.47 

0.92 

search 

0.32 

0.27 

0.27 

0.28 

0.20 

0.33 

0.32 

0.62 

50,000 

ins/del 

1.28 

2.17 

1.25 

2.20 

2.40 

2.03 

0.80 

1.07 

search 

0.28 

0.28 

0.27 

0.28 

0.20 

0.30 

0.37 

0.62 

delete 

0.30 

0.75 

0.37 

0.85 

1.05 

0.65 

0.30 

0.27 

insert 

1.50 

2.52 

1.70 

2.58 

2.53 

2.58 

0.90 

1.72 

search 

0.70 

0.60 

0.57 

0.70 

0.42 

0.70 

0.63 

1.23 

100,000 

ins/del 

2.60 

4.68 

2.53 

4.78 

5.13 

4.42 

1.52 

2.43 

search 

0.63 

0.60 

0.55 

0.62 

0.42 

0.70 

0.58 

1.35 

delete 

0.62 

1.65 

0.78 

1.87 

2.15 

1.42 

0.45 

0.55 

insert 

3.12 

4.82 

3.38 

5.67 

5.25 

4.72 

1.80 

3.52 

search 

1.38 

1.30 

1.22 

1.33 

0.90 

1.60 

1.25 

2.70 

200,000 

ins/del 

5.15 

10.40 

5.35 

10.40 

10.88 

9.48 

3.10 

5.13 

search 

1.33 

1.33 

1.18 

1.32 

0.90 

1.50 

1.28 

2.72 

delete 

1.35 

3.63 

1.68 

4.12 

4.58 

2.98 

0.93 

1.12 

Time  Unit  :  sec 
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SKIP 

0.30 
0.21 
0.53 
0.21 
0.19 

1.67 
1.44 
3.15 
1.33 
1.22 

3.61 
3.00 
7.10 
3.01 
2.75 

7.90 

6.62 

15.83 

6.74 

6.04 

TRP 

0.23 
0.13 
0  41 
0.12 
0.11 

1.19 
0.71 
2.17 
0.71 
0.69 

2.67 
1.66 
4.84 
1.65 
1.61 

5.81 
3.67 
10  48 
3.73 
3.54 

DSL 

0.22 
0.13 
0  62 
0.12 
0.28 

1.48 
0.87 
3.93 
0.86 
1.80 

3.05 
1.84 
8.45 
1.83 
3.91 

6.77 
4.14 
18  88 
4.08 
8.56 

H 

CO 

m 

CQ 

0.36 
0.10 
0.79 
0.10 
0.30 

1.93 
0.52 
4.22 
0.55 
1.76 

4.36 
1.16 
9.30 
1.22 
3.95 

9.33 

2.57 

20.46 

2.66 

8.80 

WB 

0.23 
0.08 
0  38 
0.08 
0.14 

1.29 
0.51 
2.29 
0.56 
0.92 

2.66 
1.14 
4.96 
1.26 
2.03 

5.56 
2.45 
10  65 
2.74 
4.43 

RB-B 

0.17 
0.10 
0.32 
0.10 
0.14 

0.86 
0.51 
1.53 
0.54 
0.72 

1.90 
1.13 
3.33 
1.17 
1.55 

3.88 
2.50 
7.29 
2.62 
3.41 

RB-T 

0.21 
0.09 
0.51 
0.09 
0.17 

1.22 
0.50 
2.74 
0.56 
1.10 

2.34 
1.09 
6.18 
1.27 
2.51 

5.20 

2.42 

13.78 

2.86 

5.55 

AVL 

0.15 
0.07 
0.27 
0.08 
0.09 

0.97 
0.52 

1  77 

0.55 
0.67 

1.85 
1.13 

1.26 
1.50 

3.95 
2.49 
8.25 
2.83 
3.27 

BST 

0.14 

0.09 
n  OA 

0.09 
0.09 

0.94 
0.64 

1  .uo 

0.66 
0.63 

2.06 
1.43 

1.45 
1.39 

4.34 
3.19 

3.21 
3.11 

operation 

insert 

search 

ins/del 

search 

delete 

insert 

search 

ins/del 

search 

delete 

insert  1 

search 

ins/del 

search 

delete 

insert 

search 

ins/del 

search 

delete 

c 

10,000 

50,000 

100,000 

200,000 

a 
6 

•  ^ 
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Table  3.20.  Run  time  on  ordered  real  inputs  (version  1  code) 


n 

operation 

AVL 

RB-T 

RB-B 

WB 

BBST 

DSL 

TRP 

SKIP 

0.13 

0.22 

0.15 

0.25 

0.20 

0.25 

0.12 

0.30 

CVi  \^1X 

0.07 

0.08 

0.07 

0.07 

0.07 

0.10 

0.07 

0.15 

10,000 

ins/del 

0.23 

0.42 

0.27 

0.40 

0.43 

0.47 

0.18 

0.28 

search 

0.07 

0.05 

0.08 

0.08 

0.05 

0.08 

0.08 

0.12 

delete 

0.07 

0.17 

0.08 

0.15 

0.20 

0.20 

0.05 

0.07 

insert 

1.15 

1.58 

1.12 

1.85 

1.12 

1.30 

0.67 

1.35 

search 

0.42 

0.42 

0.43 

0.40 

0.30 

0.53 

0.38 

0.82 

50,000 

ins/del 

1.28 

2.75 

1.57 

2.57 

2.37 

3.02 

0.92 

1.40 

search 

0.40 

0.42 

0.42 

0.48 

0.30 

0.53 

0.40 

0.75 

delete 

0.38 

0.95 

0.55 

0.93 

0.97 

1.15 

0.33 

0.35 

insert 

1.77 

3.23 

2.12 

3.35 

2.77 

3.13 

1.17 

2.42 

search 

0.90 

0.87 

0.90 

0.88 

0.63 

1.12 

0.92 

1.70 

100,000 

ins/del 

3.00 

6.00 

3.42 

5.38 

5.13 

6.32 

1.92 

3.22 

search 

0.97 

0.92 

0.88 

0.98 

0.63 

1.12 

0.82 

1.70 

delete 

0.87 

2.08 

1.17 

2.05 

2.10 

2.40 

0.70 

0.67 

insert 

3.92 

6.42 

4.27 

7.25 

4.92 

6.03 

2.58 

4.93 

search 

1.92 

1.87 

1.92 

1.88 

1.32 

2.40 

1.85 

3.87 

200,000 

ins/del 

5.78 

13.80 

7.33 

11.88 

10.93 

13.72 

3.75 

6.67 

search 

1.90 

1.93 

1.92 

2.13 

1.33 

2.38 

1.75 

3.97 

delete 

1.67 

4.55 

2.48 

4.45 

4.43 

5.10 

1.40 

1.35 

Time  Unit  :  sec 
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Time  is  sum  of  time  for  parts  (b)  and  (d)  of  the  experiment 
8  I  1  1—  1  TK 

7  -       MIX  ^  - 
BEST 


Time 
(sec) 


50000  100000  150000  200000 


n 

Figure  3.16.  Run  time  on  ordered  reed  inputs  (version  1  code) 

lists  performed  best  on  part  (a).  Of  the  deterministic  methods,  BBSTs  slightly  out- 
performed the  others  on  part  (a).  On  parts  (b)  -  (e),  AVL,  RB-T,  RB-B,  WB,  and 
BBSTs  were  quite  competitive  and  outperformed  BSTs  and  the  randomized  schemes. 
BBSTs  performed  best  on  parts  (b)  and  (d),  RB-Ts  did  best  on  part  (e)  and  RB-B 
and  AVL  did  best  on  part  (c).  In  comparing  the  results  of  Table  3.21  to  those  of 
Table  3.11  (using  version  1  code),  we  see  that  the  change  to  version  2  generally  in- 
crecised  the  compaxison  cost  of  the  deterministic  tree  structures  by  about  25%.  For 
the  DSL,  the  change  in  code  had  mixed  results.  Notice  that  for  RB-T  and  DSLs, 
the  comparison  count  for  parts  (a),  (c),  and  (e)  are  the  same  as  for  the  version  1 
code.  This  is  because  for  inserts  and  deletes,  it  is  necessary  to  do  the  equal  check 
first  when  using  these  structures.  For  SKIPs  the  count  is  the  same  for  all  five  parts 
as  the  version  1  «ind  2  codes  are  the  same. 


98 


With  ordered  data  (Table  3.22),  treaps  required  the  fewest  comparisons  for  part 
(a).  Skip  lists  did  best  on  parts  (c)  and  (e),  and  AVL  trees  generally  outperformed 
the  other  structures  on  parts  (b)  and  (d).  Once  again,  the  comparison  counts  were 
generally  higher  using  the  version  2  code  thain  using  the  version  1  code. 

Run  time  data  using  real  keys  is  given  in  Tables  3.23  and  3.24.  The  sum  of  the 
run  time  for  parts  (b)  and  (d)  of  the  experiment  is  graphed  in  Figure  3.17  for  random 
data  and  in  Figure  3.18  for  ordered  data.  The  graph  of  Figure  3.17  shows  only  one 
line  MIX  for  AVL,  RB-T,  RB-B,  WB,  and  BBST  while  that  of  Figure  3.18  shows 
MIX  for  AVL,  RB-T,  RB-B,  and  WB  as  the  times  for  these  are  very  close.  With 
random  data,  RB-B  generally  performed  best  on  part  (a),  on  parts  (b)  and  (d),  the 
front  runner  varied  among  AVL,  RB-T,  and  WB,  and  on  parts  (c)  and  (e)  RB-Bs 
generally  did  best.  On  ordered  data,  TRPs  did  best  on  parts  (a),  (c),  and  (e)  while 
BBSTs  did  best  on  parts  (b)  and  (d). 

3.8  Conclusion 

We  have  developed  a  new  weight  balanced  data  structure  called  ^-BBST.  This 
was  developed  for  the  representation  of  a  dictionary.  In  developing  the  insert/delete 
algorithms,  we  sought  to  minimize  the  search  cost  of  the  resulting  tree.  Our  experi- 
mental results  show  that  BBSTs  generally  have  the  best  search  cost  of  the  structures 
considered.  Furthermore,  this  translates  into  reduced  search  time  when  the  key  com- 
parison cost  is  relatively  high  (e.g.,  for  real  keys).  The  insert  and  delete  algorithms 
for  ^-BBSTs  are  not  as  efficient  as  those  for  other  dictionary  structures  (such  as 
AVL  trees).  As  a  result,  we  recommend  /9-BBSTs  for  environments  where  searches 
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SKIP 

0.31 
0.21 
0.54 
0.21 
0.19 

1.65 
1.42 
3.16 
1.32 
1.21 

3.69 
3.04 
7.18 
2.98 
2.77 

8.01 

6.60 

16.11 

6.70 

6.04 

TRP 

0.25 
0.16 
0  50 
0.14 
0.14 

1.34 
0.83 
2  44 
0.79 
0.75 

2.95 
1.84 
5.39 
1.82 
1.73 

6.81 

4.00 

12.17 

4.02 

3.88 

DSL 

0.23 
0.13 
0  64 
0.13 
0.29 

1.44 
0.89 
3  82 
0.87 
1.80 

3.11 
1.97 
8  50 
1.95 
3.93 

7.05 
4.43 
19  55 
4.37 
8.91 

BBST 

0.36 
0.11 
0.80 
0.11 
0.32 

1.88 
0.55 
3.97 
0.56 
1.63 

4.01 
1.23 
8.73 
1.27 
3.64 

9.20 

2.69 

19.53 

2.77 

8.24 

WB 

0.25 
0.09 
0.47 
0.09 
0.18 

1.46 
0.57 
2  93 
0.57 
1.19 

3.18 
1.23 
6.28 
1.29 
2.57 

6.92 

2.73 

13.85 

2.83 

5.65 

RB-B 

0.18 
0.11 
0.34 
0.11 
0.14 

0.89 
0.55 
1.59 
0.55 
0.66 

1.90 
1.18 
3.46 
1.25 
1.45 

4.28 
2.64 
7.88 
2.73 
3.20 

RB-T 

0.20 
0.10 
0.52 
0.10 
0.20 

1.15 
0.57 
2.99 
0.63 
1.22 

2.58 
1.24 
6.74 
1.45 
2.75 

5.79 

2.70 

15.36 

3.13 

6.08 

AVL 

0.14 
0.08 

0.08 
0.10 

0.98 
0.55 

l.OU 

0.60 
0.67 

2.00 
1.21 
3  Q9 
1.32 
1.49 

4.45 
2.63 
8  87 
2.98 
3.32 

BST 

0.15 
0.10 
0  27 
0.10 
0.10 

1.02 
0.69 
1  79 
0.71 
0.67 

2.15 
1.52 
3  88 
1.55 
1.51 

5.04 
3.43 
8  92 
3.43 
3.33 

operation 

insert 

search 

ins/del 

search 

delete 

insert 

search 

ins/del 

search 

delete 

insert 

search 

ins/del 

search 

delete 

insert 

search 

ins/del 

search 

delete 

c 

10,000 
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100,000 

1 

200,000 

Time 
{sec) 


Time  is  sum  of  time  for  parts  (b)  and  (d)  of  the  experiment 
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Figure  3.17.  Run  time  on  random  real  inputs  (version  2  code) 


Time  is  sum  of  time  for  parts  (b)  and  (d)  of  the  experiment 
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Figure  3.18.  Run  time  on  ordered  real  inputs  (version  2  code) 
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Table  3.24.  Run  time  on  ordered  real  inputs  (version  2  code) 


n 

operation 

AVL 

RB-T 

RB-B 

WB 

BEST 

DSL 

TRP 

SKIP 

iiiocr  I 

n  17 

0.23 

0.28 

0.27 

0.30 

0.23 

0.15 

0.30 

0  08 

0.08 

0.12 

0.08 

0.08 

0.12 

0.12 

0.13 

10,000 

1  n  <?  /  n  p  1 

0.23 

0.43 

0.40 

0.47 

0.60 

0.48 

0.17 

0.27 

0  08 

0.08 

0.07 

0.08 

0.08 

0.08 

0.10 

0.13 

0.08 

0.15 

0.12 

0.17 

0.20 

0.20 

0.08 

0.05 

insert 

0.83 

1.45 

1.43 

1.57 

1.37 

1.35 

0.82 

1.18 

search 

0.45 

0.48 

0.48 

0.47 

0.38 

0.60 

0.50 

0.83 

50,000 

ins/del 

1.35 

2.65 

1.95 

2.75 

2.47 

3.05 

1.05 

1.42 

search 

0.45 

0.47 

0.45 

0.47 

0.37 

0.63 

0.58 

0.77 

delete 

0.45 

1.05 

0.50 

1.00 

1.03 

1.17 

0.43 

0.33 

insert 

1.78 

2.75 

2.73 

3.43 

2.63 

3.23 

1.33 

2.18 

search 

0.97 

0.98 

1.00 

1.03 

0.77 

1.30 

1.15 

1.55 

100,000 

ins/del 

2.85 

6.22 

3.98 

6.00 

5.33 

6.37 

2.02 

3.33 

search 

0.97 

1.10 

0.98 

1.02 

0.77 

1.32 

1.03 

1.70 

delete 

0.97 

2.18 

1.05 

2.15 

2.22 

2.43 

0.63 

0.67 

insert 

3.78 

6.08 

5.43 

7.18 

5.37 

6.07 

2.87 

5.23 

se«irch 

2.08 

2.13 

2.13 

2.17 

1.63 

3.10 

2.27 

3.47 

200,000 

ins/del 

6.13 

13.93 

8.48 

13.42 

11.33 

13.60 

4.10 

7.02 

search 

2.12 

2.15 

2.13 

2.17 

1.63 

2.80 

2.18 

4.27 

delete 

2.03 

4.75 

2.27 

4.77 

4.72 

5.18 

1.35 

1.35 

Time  Unit  :  sec 
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are  done  with  much  greater  frequency  than  inserts  and/or  deletes.  Based  on  our 
experiments,  we  conclude  that  AVL  trees  remain  the  best  dictionary  structure  for 
general  applications. 

We  have  also  proposed  two  simplified  versions  of  the  BBST  called  SBBST  and 
BBSTD.  The  SBBST  seeks  only  to  provide  logarithmic  run  time  per  operation  and 
unlike  the  general  BBST,  does  not  reduce  search  cost  at  every  opportunity.  The 
SBBST  provides  slightly  better  balance  than  provided  by  WB(a)  trees.  The  BBSTD 
does  not  attempt  to  maintain  ;9-balance.  However  it  performs  rotations  to  reduce 
search  cost  whenever  possible.  Both  versions  are  very  competitive  with  BBSTs.  The 
SBBST  exhibited  much  better  run  time  performance  than  BBSTs  on  random  data 
and  the  BBSTD  slightly  outperformed  the  BBST  on  ordered  data.  However,  BBSTs 
generated  trees  with  the  lowest  search  cost  (though  not  by  much). 


CHAPTER  4 

WEIGHT  BIASED  LEFTIST  TREES  AND  MODIFIED  SKIP  LISTS 


4.1  Introduction 

Several  data  structures  (e.g.,  heaps,  leftist  trees  [9],  binomial  heaps  [10])  have 
been  proposed  for  the  representation  of  a  (single  ended)  priority  queue.  Heaps  permit 
one  to  delete  the  min  element  and  insert  an  arbitrary  element  into  an  n  element 
priority  queue  in  O(logn)  time.  Leftist  trees  support  both  these  operations  and  the 
merging  of  pairs  of  priority  queues  in  logarithmic  time.  Using  binomial  heaps,  inserts 
and  combines  take  0(1)  time  and  a  delete-min  takes  O(logn)  amortized  time.  In 
this  chapter,  we  begin  in  Section  4.2,  by  developing  the  weight  biased  leftist  tree. 
This  is  similar  to  a  leftist  tree.  However  biasing  of  left  and  right  subtrees  is  done 
by  number  of  nodes  rather  than  by  length  of  paths.  Experimental  results  presented 
in  Section  4.5  show  that  weight  biased  leftist  trees  provide  better  performance  than 
provided  by  leftist  trees.  The  experimental  comparisons  of  Section  4.5  also  include  a 
comparison  with  heaps  and  binomial  heaps  as  well  as  with  unbalanced  binary  search 
trees  and  the  probabilistic  structures  treap  [1]  and  skip  lists  [26]. 

In  Section  4.3,  we  propose  a  fixed  node  size  representation  for  skip  lists.  The 
new  structure  is  called  modified  skip  lists  and  is  experimentally  compared  with  the 
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variable  node  size  structure  skip  lists.  Our  experiments  indicate  that  modified  skip 
lists  are  faster  than  skip  lists  when  used  to  represent  dictionaries. 

Modified  skip  lists  are  augmented  by  a  thread  in  Section  4.4  to  obtain  a  structure 
suitable  for  use  as  a  priority  queue.  For  completeness,  we  include,  in  Section  4.5,  a 
comparison  of  data  structures  for  double  ended  priority  queues. 

4.2    Weight  Bia-sed  I^eftist  Trees 

Let  r  be  an  extended  binary  tree.  For  any  internal  node  x  of  T,  let  LeftChild{x) 
and  RightChild{x),  respectively,  denote  the  left  and  right  children  of  x.  The  weight, 
w{x),  of  any  node  x  is  the  number  of  internal  nodes  in  the  subtree  with  root  x. 
The  length,  shortest{x),  of  a  shortest  path  from  x  to  an  external  node  satisfies  the 


recurrence 


shortest{x)  = 


0  if  X  is  an  external  node 

1  +  min{shortest{LeftChild{x)),shortest{RightChild{x))}  otherwise. 


Definition  [9]    A  leftist  tree  (LT)  is  a  binary  tree  such  that  if  it  is  not  empty,  then 


shortest{LeftChUd{x))  >  shortest{RightChild{x)) 


for  every  internal  node  x. 

A  weight  biased  leftist  tree  (WBLT)  is  defined  by  using  the  weight  measure  in 

pl&ce  of  the  measure  shortest. 
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Definition    A  weight  biased  leftist  tree  (WBLT)  is  a  binary  tree  such  that  if  it  is 
not  empty,  then 

weight{LeftChHd{x))  >  weight{RightChild{x)) 

for  every  internal  node  x. 

It  is  known  [9]  that  the  length,  rightmost{x),  of  the  rightmost  root  to  external 
node  path  of  any  subtree,  x,  of  a  leftist  tree  satisfies 

rightmost{x)  <  log2(u'(a:)  +  1). 

The  same  is  true  for  weight  biased  leftist  trees. 

Theorem  13  Let  x  he  any  internal  node  of  a  weight  biased  leftist  tree.  rightmost{x)  < 
log2(u^(x)  +  1). 

Proof     The  proof  is  by  induction  on  w{x).   When  w{x)  =  1,  rightmost{x)  = 
1  and  Iog2(u;(i)  +  1)  =  logj  2  =  1.    For  the  induction  hypothesis,  assume  that 
rightmost{x)  <  log2(iy(i)+l)  whenever         <  n.  When  u;(i)  =  n,  w{RightChild{x))  < 
(n  -  l)/2  and  rightmost{x)  =  1  +  rightmost{RightChild{x))  <  1  +  log2((n  -  l)/2  + 
1)  =  1  +  log2(n  +  1)  -  1  =  log2(n  +  1).  □ 

Definition    A  min  (max)-WBLT  is  a  WBLT  that  is  also  a  min  (max)  tree. 

Each  node  of  a  min- WBLT  h<is  the  fields:  Isize  (number  of  internal  nodes 
in  left  subtree),  rsize,  le ft  (pointer  to  left  subtree),  right,  and  data.   While  the 
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head 


Isize 


rsize 


head 


oo    -00     0  ^ 


data 


bottom 


bottom 


(a)  Empty  min-WBLT  (b)  Nonempty  min-WBLT 

Figure  4.1.  Example  min-WBLTs 


number  of  size  fields  in  a  node  may  be  reduced  to  one,  two  fields  result  in  a  faster 
implementation.  We  assume  a  head  node  head  with  Isizt  =  oo  and  \chi\d  =  head.  In 
addition,  a  bottom  node  hoiiom  with  daia.kty  =  oo.  All  pointers  that  would  normally 
be  nil  are  replaced  by  a  pointer  to  hottam.  Figure  4.1(a)  shows  the  representation  of 
an  empty  min-WBLT  and  Figure  4.1(b)  shows  an  example  non  empty  min-WBLT. 
Notice  that  all  elements  are  in  the  right  subtree  of  the  head  node. 

Min  (max)-WBLTs  can  be  used  as  priority  queues  in  the  same  way  as  min  (max)- 
LTs.  For  instance,  a  min-WBLT  supports  the  standard  priority  queue  operations 
of  insert  and  delete-min  in  logarithmic  time.  In  addition,  the  combine  operation 
(i.e.,  join  two  priority  queues  together)  can  also  be  done  in  logarithmic  time.  The 
algorithms  for  these  operations  have  the  same  flavor  as  the  corresponding  ones  for 
min-LTs.  A  high  level  description  of  the  insert  and  delete-min  algorithm  for  min- 
WBLT  is  given  in  Figures  4.2  and  4.3,  respectively.  The  algorithm  to  combine  two 
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procedure  Insert((f)  ; 
{insert  d  into  a  min-WBLT} 
begin 

create  a  node  x  with  x.data  -  d  ; 
t  =  head  ;  {head  node} 
while  {t. right. data.key  <  d.key)  do 
begin 

t.rsize  =  t.rsize  +  1  ; 
if  {t.lsize  <  t.rsize)  then 

begin  swap  t's  children  ;  t  =  t.left  ;  end 
else  t  =  t.right  ; 
end  ; 

x.left  =  t.right  ;  x.right  =  bottom  ; 
x.lsize  =  t.rsize  ;  x.rsize  =  0  ; 
if  {t.lsize  =  t.rsize)  then  {swap  children} 
begin 

t.right  =  f./e/t  ; 

f./e/f  =  X  ;  t.lsize  =  x.lsize  +  1  ; 
end 

else 

begin  t.right  =  x  ;  f.rsize  =  t.rsize  +  1  ;  end  ; 
end  ; 

Figure  4.2.  min-WBLT  hisert 
min-WBLTs  is  similar  to  the  delete-min  algorithm.  The  time  required  to  perform 
each  of  the  operations  on  a  min-WBLT  T  is  0{rightTnost{T)). 

Notice  that  while  the  insert  and  delete-min  operations  for  min-LTs  require  a 
top-down  pass  followed  by  a  bottom-up  pass,  these  operations  can  be  performed  by 
a  single  top-down  pass  in  min-WBLTs.  Hence,  we  expect  min-WBLTs  to  outperform 
min-LTs. 

4.3    Modified  Skip  Lists 

Skip  lists  were  proposed  in  Pugh  [26]  as  a  probabilistic  solution  for  the  dictionary 
problem  (i.e.,  represent  a  set  of  keys  and  support  the  operations  of  search,  insert, 
and  delete).  The  essential  idea  in  skip  lists  is  to  maintain  upto  Imax  ordered  chains 


110 


procedure  Delete-min  ; 
begin 

X  =  head.right  ; 

if  (i  =  bottom)  then  return  ;  {empty  tree} 
head.right  =  x.left  ;  head.rsize  =  x.lsize  ; 
a  =  head; 

b  =  x.right  ;  bsize  =  x.rsize  ; 
delete  x  ; 

if  (6  =  bottom)  then  return  ; 
r  =  a.right  ; 
while  (r  7^  bottom)  do 
begin 

5  =  6st0e  +  a.rsize  ;  t  —  a.rsize  ; 
if  {a.lsize  <  s)  then  {work  on  a.left) 
begin 

a.right  =  a.left  ;  a.rsize  =  a.lsize  ;  a.lsize  =  s  ; 
if  {r.data.key  >  b.data.key)  then 

begin  a.left  =  b  ;  a  =  b;b  =  r  ;  bsize  =  t  ;  end 

else 

begin  a.left  =  r  ;  a  =  r  ;  end 

end 

else 

do  symmetric  operations  on  a.right  ; 
r  =  a.right  ; 
end  ; 

if  {a.lsize  <  bsize)  then 
begin 

a.right  =  a.left  ;  a.left  =  b  ; 
a.rsize  =  a.lsize  ;  a.lsize  =  bsize  ; 
end 

else 

begin  a.right  =  b  ;  a.rsize  =  bsize  ;  end  ; 
end  ; 

Figure  4.3.  min-WBLT  Delete-min 
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Figure  4.4.  Skip  Lists 

designated  as  level  1  chain,  level  2  chain,  etc.  If  we  currently  have  Icurrent  number 
of  chains,  then  all  n  elements  of  the  dictionary  are  in  the  level  1  chain  and  for  each 
1,2  <  I  <  Icurrent,  approximately  a  fraction  p  of  the  elements  on  the  level  /  -  1 
chain  are  also  on  the  level  /  chain.  Ideally,  if  the  level  /  -  1  chain  has  m  elements 
then  the  approximately  mxp  elements  on  the  level  /  chain  are  about  1/p  apart  in 
the  level  /  -  1  chain.  Figure  4.4  shows  an  ideal  situation  for  the  case  Icurrent  =  4 
and  p  =  1/2. 

While  the  search,  insert,  and  delete  algorithms  for  skip  lists  are  simple  and  have 
probabilistic  complexity  O(logn)  when  the  level  1  chain  has  n  elements,  skip  lists 
suffer  from  the  following  impleinentational  drawbacks: 

1.  In  programming  languages  such  as  Pascal,  it  isn't  possible  to  have  variable  size 
nodes.  As  a  result,  each  node  has  one  data  field,  and  Imax  pointer  fields.  So, 
the  n  element  nodes  have  a  total  of  n  x  Imax  pointer  fields  even  though  only 
about  n/(l  -p)  pointers  are  necessary.  Since  Imax  is  generally  much  larger  than 
3  (the  recommended  value  is  logj/p  nMax  where  nMax  is  the  largest  number  of 
elements  expected  in  the  dictionary),  skip  lists  require  more  space  than  WBLTs. 
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2.  While  languages  such  as  C  and  C++  support  variable  size  nodes  and  we  can 
construct  variable  size  nodes  using  siniulated  pointers  [27]  in  languages  such  as 
Pascal  that  do  not  support  variable  size  nodes,  the  use  of  variable  size  nodes 
requires  more  complex  storage  management  techniques  than  required  by  the 
use  of  fixed  size  nodes.  So,  greater  efficiency  can  be  achieved  using  simulated 
pointers  and  fixed  size  nodes. 

With  these  two  observations  in  mind,  we  propose  a  modified  skip  list  (MSL) 
structure  in  which  each  node  has  one  data  field  and  three  pointer  fields:  left,  right, 
and  down.  Notice  that  this  means  MSLs  use  four  fields  per  node  while  WBLTs  use 
five  (as  indicated  earlier  this  can  be  reduced  to  four  at  the  expense  of  increased  run 
time).  The  left  and  right  fields  are  used  to  maintain  each  level  /  chain  as  a  doubly 
linked  list  and  the  down  field  of  a  level  /  node  x  points  to  the  leftmost  node  in  the  level 
/  -  1  chain  that  has  key  value  larger  than  the  key  in  x.  Figure  4.5  shows  the  modified 
skip  list  that  corresponds  to  the  skip  list  of  Figure  4.4.  Notice  that  each  element  is 
in  exactly  one  doubly  linked  list.  We  can  reduce  the  number  of  pointers  in  each  node 
to  two  by  eliminating  the  field  le ft  and  having  down  point  one  node  the  left  of  where 
it  currently  points  (except  for  head  nodes  whose  down  fields  still  point  to  the  head 
node  of  the  next  chain).  However,  this  results  in  a  less  time  efficient  implementation. 
H  and  T,  respectively,  point  to  the  head  and  tail  of  the  level  Icurrent  chain. 

A  high  level  description  of  the  algorithms  to  search,  insert,  and  delete  are  given  in 
Figures  4.6,  4.7,  and  4.8.  The  next  theorem  shows  that  their  probabilistic  complexity 
is  O(logn)  where  n  is  the  total  number  of  elements  in  the  dictionary. 
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Figure  4.5.  Modified  Skip  Lists 


procedure  Search(key)  ; 

begin 

p  =  H- 

while  (p  /  nil)  do 
begin 

while  {p.data.key  <  key)  do 

p  =  p.right  ; 
if  {p.data.key  =  key)  then  report  and  stop 
else  p  -  p.left.down  ;  {1  level  down) 
end  ; 
end  ; 

Figure  4.6.  MSL  Search 
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procedure  Insert(d)  ; 
begin 

randomly  generate  the  level  k  at  which  d  is  to  be  inserted  ; 
search  the  MSL  H  for  d.key  saving  information  useful  for  insertion  ; 
if  d.key  is  found  then  fail  ;  {duplicate} 
get  a  new  node  x  and  set  x.data  =  d  ; 
if  ((Jk  >  /current)  and  (/current  ^  Imax))  then 
begin 

Icurrent  =  Icurrent  +  1  ; 

create  a  new  chain  with  a  head  node,  node  x,  and  a  tail  and 
connect  this  chain  to  H  ; 
update  H  ; 

set  x.down  to  the  appropriate  node  in  the  level  Icurrent  -  1  chain 

(to  nil  i{  k  =  I)  ; 

end 

else 

begin 

insert  x  into  the  level  k  chain  ; 

set  x.down  to  the  appropriate  node  in  the  level  k  -  I  chain 
(to  nil  if    =  1)  ; 

update  the  down  field  of  nodes  on  the  level  k  +  1  chain  (if  any)  as  needed  ; 
end  ; 
end  ; 

Figure  4.7.  MSL  Insert 


procedure  Delete(z)  ; 
begin 

search  the  MSL  H  for  a  node  x  with  data.key  =  z  saving  information  useful 

for  deletion; 

if  not  found  then  fail  ; 

let  k  be  the  level  at  which  z  is  found  ; 

for  each  node  p  on  level  k  +  1  that  has  p.down  =  x,  set  p.down  =  x.right  ; 

delete  x  from  the  level  k  list  ; 

if  the  Ust  at  level  Icurrent  becomes  empty  then 

delete  this  and  succeeding  empty  lists  until  we  reach  the  first  non  empty  list, 

update  Icurrent  ; 
end  ; 


Figure  4.8.  MSL  Delete 


115 


Theorem  U  The  probabilistic  complexity  of  the  MSL  operations  is  0([ogn). 

Proof  We  establish  this  by  showing  that  our  algorithms  do  at  most  a  logarithmic 
amount  of  additional  work  than  do  those  of  Pugh  [26].  Since  the  algorithms  of  Pugh 
[26]  has  probabilistic  O(logn)  complexity,  so  also  do  ours.  During  a  search,  the  extra 
work  results  from  moving  back  one  node  on  each  level  and  then  moving  down  one  level. 
When  this  is  done  from  any  level  other  than  Icurrent,  we  expect  to  examine  upto 
c  =  1/p- 1  additional  nodes  on  the  next  lower  level.  Hence,  upto  c{lairrent-2)  addi- 
tional nodes  get  examined.  During  an  insert,  we  also  need  to  verify  that  the  element 
being  inserted  isn't  one  of  the  elements  already  in  the  MSL.  This  requires  an  addi- 
tional comparison  at  each  level.  So,  MSLs  may  make  upto  c{lcurrent  -  2)  +  Icurrent 
additional  compares  during  an  insert.  The  number  of  down  pointers  that  need  to 
be  changed  during  an  insert  or  delete  is  expected  to  be  »P'  =  (T^p-  ^^^^  ^ 
and  p  are  constants  and  Imax  =  logi/^  n,  the  expected  additional  work  is  O(log  n).  □ 

The  relative  performance  of  skip  lists  and  modified  skip  lists  as  a  data  structure 
for  dictionaries  was  determined  by  programming  the  two  in  C.  Both  were  implemented 
using  simulated  pointers.  The  simulated  pointer  implementation  of  skip  lists  used 
fixed  size  nodes.  This  avoided  the  use  of  complex  storage  management  methods  and 
biased  the  run  time  measurements  in  favor  of  skip  lists.  For  the  case  of  skip  lists,  we 
used  p  =  1/4  and  for  MSLs,  p  =  1/5.  These  values  of  p  were  found  to  work  best  for 
each  structure.  Imax  was  set  to  16  for  both  structures. 
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We  experimented  with  n  =  10,000,  50,000,  100,000,  and  200,000.  For  each  n, 
the  following  five  part  experiment  was  conducted: 

(a)  start  with  an  empty  structure  and  perform  n  inserts; 

(b)  search  for  each  item  in  the  resulting  structure  once;  items  are  searched  for  in  the 
order  they  were  inserted 

(c)  perform  an  alternating  sequence  of  n  inserts  and  n  deletes;  in  this,  the  n  elements 
inserted  in  (a)  are  deleted  in  the  order  they  were  inserted  and  n  new  elements  are 
inserted 

(d)  search  for  each  of  the  remaining  n  elements  in  the  order  they  were  inserted 

(e)  delete  the  n  elements  in  the  order  they  were  inserted. 

For  each  n,  the  above  five  part  experiment  was  repeated  ten  times  using  different 
random  permutations  of  distinct  elements.  For  each  sequence,  we  measured  the  total 
number  of  element  comparisons  performed  and  then  averaged  these  over  the  ten 
sequences.  The  average  number  of  comparisons  for  each  of  the  five  parts  of  the 
experiment  are  given  in  Table  4.1. 

Also  given  in  this  table  is  the  number  of  comparisons  using  ordered  data.  For 
this  data  set,  elements  were  inserted  and  deleted  in  the  order  1,2,3,....  For  the 
case  of  random  data,  MSLs  make  40%  to  50%  more  comparisons  on  each  of  the  five 
parts  of  the  experiment.  On  ordered  inputs,  the  disparity  is  even  greater  with  MSLs 
making  30%  to  140%  more  comparison.  Table  4.2  gives  the  number  of  levels  in  SKIP 
and  MSL.  The  first  number  of  each  entry  is  the  number  of  levels  following  part  (a) 
of  the  experiment  and  the  second  the  number  of  levels  following  part  (b).  As  can  be 
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Table  4.1.  The  number  of  key  comparisons 


random  inputs 

ordered  inputs 

operation 

SKIP 

MbL 

CVTD 

JVlOlj 

insert 

224757 

322499 

247129 

318854 

search 

255072 

362865 

256706 

339019 

10,000 

ms/  tiei 

519430 

734161 

354566 

h»    c\c\  1  c\ 

560219 

searcn 

256124 

349591 

250538 

339121 

delete 

231745 

320594 

84392 

185489 

insert 

1357076 

1950583 

1422120 

1911818 

seairch 

1537547 

1965649 

1467217 

1836713 

50,000 

ins /del 

2996512 

4142186 

1973416 

3204400 

search 

1501731 

2038774 

1449810 

1989550 

delete 

1373858 

1853671 

486498 

931975 

insert 

2919371 

4146428 

2925618 

4275880 

search 

3188621 

4315576 

2970715 

4082193 

100,000 

ins/del 

6399463 

9103135 

4406427 

6895510 

search 

3225343 

4427979 

3277089 

4345874 

delete 

2981173 

4161994 

961283 

2052638 

insert 

6178596 

8927523 

6403207 

9022631 

search 

6697223 

9273707 

6448304 

8946474 

200,000 

ins/del 

13377747 

19370831 

9054078 

9062233 

search 

6680642 

9662006 

6458321 

9197714 

delete 

6149268 

9101721 

1995215 

4837867 

118 


Table  4.2.  Number  of  levels 


n 

random  inputs 

ordered  inputs 

SKIP 

MSL 

SKIP 

MSL 

10,000 

8,8 

7,7 

8,8 

7,7 

50,000 

9,9 

7,7 

9,9 

7,7 

100,000 

9,9 

7,8 

9,9 

7,8 

200,000 

9,9 

8,9 

9,9 

8,9 

seen,  the  number  of  levels  is  very  comparable  for  both  structures.  MSLs  generally 
had  one  or  two  levels  fewer  than  SKIPs  had. 

Despite  the  large  disparity  in  number  of  comparisons,  MSLs  generally  required 
less  time  than  required  by  SKIPs  (see  Table  4.3  and  Figure  4.9).  Integer  keys  were 
used  for  our  run  time  measurements.  In  many  practical  situations  the  observed  time 
difference  will  be  noticeably  greater  as  one  would  need  to  code  skip  lists  using  more 
complex  storage  management  techniques  to  allow  for  variable  size  nodes. 

4.4    MSLs  As  Priority  Queues 

At  first  glance,  it  might  appear  that  skip  lists  are  clearly  a  better  choice  than 
modified  skip  Usts  for  use  as  a  priority  queue.  The  min  element  in  a  skip  list  is 
the  first  element  in  the  level  one  chain.  So,  it  can  be  identified  in  0(1)  time  and 
then  deleted  in  O(logn)  probabihstic  time.  In  the  case  of  MSLs,  the  min  element 
is  the  first  one  in  one  of  the  Icurrent  chains.  This  can  be  identified  in  logarithmic 
time  using  a  loser  tree  whose  elements  are  the  first  element  from  each  MSL  chain. 
By  using  an  additional  pointer  field  in  each  node,  we  can  thread  the  elements  in  an 
MSL  into  a  chain.  The  elements  appear  in  non-decending  order  on  this  chain.  The 


Table  4.3.  Run  time 


random  inputs 

ordered  inputs 

n 

operation 

SKIP 

MSL 

SKIP 

MSL 

insert 

0.24 

0.18 

0.20 

0.17 

search 

0.18 

0.12 

0.12 

0.07 

10,000 

ins/del 

fi  45 

0.35 

0.20 

0.20 

search 

0.18 

0.12 

0.13 

0.07 

delete 

0.16 

0.12 

0.07 

0.05 

insert 

1.36 

1.22 

0.92 

0.80 

search 

1.25 

0.98 

0.62 

0.38 

50,000 

ins/del 

2.73 

2.53 

1.07 

1.08 

search 

1.16 

1.00 

0.62 

0.42 

delete 

1.10 

0.83 

0.27 

0.23 

insert 

2.84 

2.86 

1.72 

1.60 

search 

2.63 

2.39 

1.23 

0.85 

100,000 

ins/del 

6.13 

5.80 

2.43 

2.28 

search 

2.61 

2.33 

1.35 

0.92 

delete 

2.41 

2.02 

0.55 

0.52 

insert 

6.25 

6.49 

3.52 

3.47 

search 

5.85 

5.34 

2.70 

1.87 

200,000 

ins/del 

13.29 

13.02 

5.13 

4.75 

search 

5.81 

5.51 

2.72 

1.92 

delete 

5.35 

4.85 

1.12 

1.18 
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Time  is  sum  of  time  for  parts  (a)-(e)  of  the  experiment 
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Figure  4.9.  Run  time 

resulting  threaded  structure  is  referred  to  as  TMSL  (threaded  modified  skip  lists). 
A  delete  min  operation  can  be  done  in  0(1)  expected  time  when  a  TMSL  is  used. 
The  expected  time  for  an  insert  remains  O(logn).  The  algorithms  for  the  insert  and 
delete  min  operations  for  TMSLs  are  given  in  Figures  4.10  and  4.11,  respectively. 
The  last  step  of  Figure  4.10  is  implemented  by  first  finding  the  largest  element  on 
level  1  with  key  <  d.key  (for  this,  start  at  level  Icurrent  -  1)  and  then  follow  the 
threaded  chciin. 

Theorem  15  The  expected  complexity  of  an  insert  and  delete-min  operation  in  a 
TMSL  is  O(\ogn)  and  0(1),  respectively. 


Proof    Follows  from  the  notion  of  a  thread,  Theorem  14,  and  Pugh  [26]. 


□ 
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procedure  Insert(d)  ; 
begin 

randomly  generate  the  level  k  at  which  d  is  to  be  inserted  ; 
get  a  new  node  x  and  set  x.data  =  d  ; 
if  {{k  >  Icurrent)  and  {Icurrent  ^  Imax))  then 
begin 

Icurrent  =  Icurrent  +  1  ; 

create  a  new  chain  with  a  head  node,  node  x,  and  a  tail  and 
connect  this  chain  to  H  ; 
update  H  ; 

set  x.down  to  the  appropriate  node  in  the  level  Icurrent  —  1  chain 

(to  nil  if  A;  =  1)  ; 

end 

else 

begin 

insert  x  into  the  level  k  chain  ; 

set  x.down  to  the  appropriate  node  in  the  level  k  —  \  chain  (to  nil  if  ^  =  1)  ; 
update  the  down  field  of  nodes  on  the  level  k  ■\-\  chain  (if  any)  as  needed  ; 
end  ; 

find  node  with  largest  key  <  d.key  and  insert  x  into  threaded  list  ; 
end  ; 

Figure  4.10.  TMSL  Insert 


procedure  Delete-min  ; 
begin 

delete  the  first  node  i  from  the  thread  list  ; 
let  k  be  the  level  x  is  on  ; 

delete  x  from  the  level  k  list  (note  there  cire  no  down  fields  on  level  ^:  +  1 
that  need  to  be  updated)  ; 

if  the  list  at  level  Icurrent  becomes  empty  then 

delete  this  and  succeeding  empty  lists  until  we  reach  the  first  non  empty  list, 

update  Icurrent  ; 
end  ; 


Figure  4.11.  TMSL  Delete-min 
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procedure  Delete-max  ; 
begin 

delete  the  last  node  x  from  the  thread  list  ; 
let  k  be  the  level  x  is  on  ; 

delete  x  from  the  level  k  list  updating  p.down  for  nodes  on  level  A:  + 1  as  necessary; 
if  the  list  at  level  Icurrent  becomes  empty  then 

delete  this  and  succeeding  empty  lists  until  we  reach  the  first  non  empty  list, 

update  Icurrent  ; 
end  ; 

Figure  4.12.  TMSL  Delete-max 
TMSLs  may  be  further  extended  by  making  the  threaded  chain  a  doubly  Unked 
list.  This  permits  both  delete-min  and  delete-max  to  be  done  in  0(1)  expected  time 
and  insert  in  O(logn)  expected  time.  With  this  extension,  TMSLs  may  be  used  to 
represent  double  ended  priority  queues. 

4  5    Experimental  Results  For  Priority  Queues 

The  single-ended  priority  queue  structures  min  heap  (Heap),  binomial  heap 
(B-Heap),  leftist  trees  (LT),  weight  biased  leftist  trees  (WBLT),  and  TMSLs  were 
programmed  in  C.  In  addition,  priority  queue  versions  of  unbalanced  binary  search 
trees  (EST),  AVL  trees,  treaps  (TRP),  and  skip  lists  (SKIP)  were  also  programmed. 
The  priority  queue  version  of  these  structures  differed  from  their  normal  dictionary 
versions  in  that  the  delete  operation  was  customized  to  support  only  a  delete  min. 
For  skip  lists  and  TMSLs,  the  level  allocation  probability  p  was  set  to  1/4.  While 
BSTs  are  normally  defined  only  for  the  case  when  the  keys  are  distinct,  they  are 
easily  extended  to  handle  multiple  elements  with  the  same  key.  In  our  extension, 
if  a  node  has  key  x,  then  its  left  subtree  has  values  <  x  and  its  right  values  >  x. 
To  minimize  the  effects  of  system  call  overheads,  all  structures  (other  than  Heap) 
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were  programmed  using  simulated  pointers.  The  min  heap  was  programmed  using  a 

one-dimensionjd  array. 

For  our  experiments,  we  began  with  structures  initialized  with  n  =  100,  1,000, 
100,000,  and  100,000  elements  and  then  performed  a  random  sequence  of  100,000 
operations.  This  random  sequence  consists  of  approximately  50%  insert  and  50% 
delete  min  operations.  The  results  are  given  in  Tables  4.4,  4.5,  and  4.6.  In  the  data 
sets  'randoml'  and  'random2',  the  elements  to  be  inserted  were  randomly  generated 
while  in  the  data  set  'increasing'  an  ascending  sequence  of  elements  was  inserted 
and  in  the  data  set  'decreasing',  a  descending  sequence  of  elements  was  used.  Since 
BST  have  very  poor  performance  on  the  last  two  data  sets,  we  excluded  it  from  this 
part  of  the  experiment.  In  the  case  of  both  randoml  and  random2,  ten  random 
sequences  were  used  and  the  average  of  these  ten  is  reported.  The  randoml  and 
random2  sequences  differed  in  that  for  randoml,  the  keys  were  integers  in  the  range 
0.. (10^-1)  while  for  random2,  they  were  in  the  range  0..999.  So,  random2  is  expected 
to  have  many  more  duplicates. 

Table  4.4  gives  the  total  number  of  comparisons  made  by  each  of  the  methods. 
On  the  two  random  data  tests,  weight  biased  leftist  trees  required  the  fewest  number 
of  comparisons  except  when  n  =  100, 000.  In  this  case,  AVL  trees  required  the  fewest. 
With  ascending  data,  treaps  did  best  and  with  descending  data,  LTs  and  WBLTs  did 
best.  For  both,  each  insert  could  be  done  with  one  comparison  as  both  structures 
build  a  left  skewed  tree. 
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CO 

a 
o 

to 
'u 

a 

o 

u 
>> 


(-1 

s 

V 

H 


AVL 

373683 
542789 
685422 
877131 

358885 
534406 
685926 
8Q01Q0 

382513 
584422 
747581 
902437 

334965 
512942 
672910 
835044 

TMSL 

503567 
729274 
1104805 
1778579 

485995 
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The  structure  height  initially  and  following  the  100,000  operations  is  given  in 
Table  4.5  for  BSTs,  Heaps,  TRPs  and  AVL  trees.  For  B-Heaps,  the  height  of  the 
tallest  tree  is  given.  For  SKIPs  and  TMSLs,  this  table  gives  the  number  of  levels.  In 
the  case  of  LT  and  WBLT,  this  table  gives  the  length  of  the  rightmost  path  following 
initialization  and  the  average  of  its  length  following  each  of  the  100,000  operations. 
The  two  leftist  structures  are  able  to  maintain  their  rightmost  paths  so  as  to  have  a 
length  much  less  than  log2(n  +  1). 

The  measured  run  times  on  a  Sun  Sparc  5  are  given  in  Table  4.6.  For  this,  the 
codes  were  compiled  using  the  cc  compiler  in  optimized  mode.  The  run  time  for  the 
data  set  randoml  is  graphed  in  Figure  4.13.  The  run  time  for  the  data  set  random2 
and  Heap,  LT,  WBLT,  SKIP,  TMSL,  and  AVL  is  graphed  in  Figure  4.14.  For  the 
data  sets  randoml  and  random2  with  n  =  100  and  1,000,  WBLTs  required  least  time. 
For  randoml  with  n  =  10,000,  BSTs  took  least  time  while  when  n  =  100,000,  both 
BSTs  and  Heaps  took  least  time.  For  random2  with  n  =  10,000,  WBLTs  were  fastest 
while  for  n  =  100, 000,  Heap  was  best.  On  the  ordered  data  sets,  BSTs  have  a  very 
high  complexity  and  are  the  poorest  performers  (times  not  shown  in  Table  4.6).  For 
increasing  data,  Heap  was  best  for  n  =  100,  1,000  and  10,000  and  both  Heap  and 
TRP  best  for  n  =  100,000.  For  decreasing  data,  WBLTs  were  generally  best.  On 
all  data  sets,  WBLTs  always  did  at  least  as  well  (and  often  better)  as  LTs.  Between 
SKIP  and  TMSL,  we  see  that  SKIP  generally  did  better  for  small  n  and  TMSL  for 
Icirge  n. 
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Figure  4.13.  Run  time  on  randoml 


Another  way  to  interpret  the  time  results  is  in  terms  of  the  ratio  m/n  (m  = 
number  of  operations).  In  the  experiments  reported  in  Table  4.6,  m  =  100,000.  As 
m/n  increases,  WBLTs  and  LTs  perform  better  relative  to  the  remaining  structures. 
This  is  because  as  m  increases,  the  (weight  biased)  leftist  trees  constructed  are  very 
highly  skewed  to  the  left  and  the  length  of  the  rightmost  path  is  close  to  one. 

Tables  4.7,  4.8,  and  4.9  provide  an  experimental  comparison  of  BSTs,  AVL 
trees,  MMHs  (min-max  heaps)  [2],  Deaps  [7],  TRPs,  SKIPs,  and  TMSLs  as  a  data 
structure  for  double  ended  priority  queues.  The  experimental  setup  is  similar  to  that 
used  for  single  ended  priority  queues.  However,  this  time  the  operation  mix  was  50% 
insert,  25%  delete-min,  and  25%  delete-max.  On  the  comparison  measure,  treaps 
did  best  on  increasing  data  (except  when  n  =  100)  and  skip  lists  did  best  when 
decreasing  data  was  used.  On  all  other  data,  AVL  trees  did  best.  As  far  as  run  time 
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Figure  4.14.  Run  time  on  random2 

is  concerned,  BSTs  did  best  on  the  random  data  tests  except  when  n  =  100,000  and 
the  set  random2  was  used.  In  this  case,  deaps  and  AVL  trees  took  least  time.  For 
increasing  data,  treaps  were  best  and  for  decreasing  data,  skip  lists  were  best.  The 
run  time  for  the  data  set  randoml  is  graphed  in  Figure  4.15.  The  run  time  for  the 
data  set  random2  and  MMH,  Deap,  SKIP,  TMSL,  and  AVL  is  graphed  in  Figure  4.16. 

4.6  Conclusion 

We  have  developed  two  new  data  structures:  weight  biased  leftist  trees  and 
modified  skip  lists.  Experiments  indicate  that  WBLTs  have  better  performance  (i.e., 
time  characteristic  and  number  of  comparisons)  than  LTs  as  a  data  structure  for 
gle  ended  priority  queues  and  MSLs  have  a  better  performance  than  skip  lists  as 
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Table  4.8.  Height/level  of  the  structures 
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Figure  4.15.  Run  time  on  raindoml 
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Table  4.9.  Run  time  using  integer  keys 
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Figure  4.16.  Run  time  on  random2 


a  data  structure  for  dictionaries.  MSLs  have  the  added  advantage  of  using  fixed  size 
nodes. 

Our  experiments  show  that  binary  search  trees  (modified  to  handle  equal  keys) 
perform  best  of  the  tested  double  ended  priority  queue  structures  using  random  data. 
Of  course,  these  are  unsuitable  for  general  application  as  they  have  very  poor  per- 
formance on  ordered  data.  Min-max  heaps,  deaps  and  AVL  trees  guarantee  O(logn) 
behavior  per  operation.  Of  these  three,  AVL  trees  generally  do  best  for  large  n.  It 
is  possible  that  other  balanced  search  structures  such  as  bottom-up  red-black  trees 
might  do  even  better.  Treaps  and  skip  lists  are  randomized  structures  with  O(logn) 
expected  complexity.  Treaps  were  generally  faster  than  skip  lists  (except  for  decreas- 
ing data)  as  double  ended  priority  queues. 
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For  single  ended  priority  queues,  if  we  exclude  BSTs  because  of  their  very  poor 
performance  on  ordered  data,  WBLTs  did  best  on  the  data  sets  randoml  and  random2 
(except  when  n  =  100,000),  and  decreasing.  Heaps  did  best  on  the  remaining  data 
sets.  The  probabilistic  structures  TRP,  SKIP  and  TMSL  were  generaUy  slower  than 
WBLTs.  When  the  ratio  m/n  (m  =  number  of  operations,  n  =  average  queue  size) 
is  large,  WBLTs  (and  LTs)  outperform  heaps  (and  all  other  tested  structures)  as  the 
binary  trees  constructed  tend  to  be  highly  skewed  to  the  left  and  the  length  of  the 
rightmost  path  is  close  to  one. 

Our  experimental  results  for  single  ended  priority  queues  are  in  marked  con- 
trast to  those  reported  in  Gonnet  and  Baeza- Yates  [11,  p228]  where  leftist  trees  are 
reported  to  take  approximately  four  times  as  much  time  as  heaps.  We  suspect  this 
difference  in  results  is  because  of  different  programming  techniques  (recursion  vs.  it- 
eration, dynamic  vs.  static  memory  allocation,  etc.)  used  in  Gonnet  and  Baeza- Yates 
[11]  for  the  different  structures.  In  our  experiments,  all  structures  were  coded  using 
similar  programming  techniques. 


CHAPTER  5 
CONCLUSIONS 

We  have  considered  the  problem  of  joining  a  row  of  compacted  cells  and  de- 
veloped heuristics  to  stretch  cells  and  river-route  the  nets  so  that  the  layout  area  is 
minimized.  Our  proposed  heuristics  were  compared,  experimentally,  with  Fang  [8] 
and  found  to  produce  layouts  with  less  area. 

We  have  developed  a  new  weight  balanced  search  structure  called  ^-BBST.  Our 
experimental  results  show  that  BBSTs  generally  have  the  best  search  cost  of  the 
structures  considered.  We  reconmiend  y9-BBSTs  for  environments  where  searches 
are  done  with  much  greater  frequency  than  inserts  and/or  deletes.  Based  on  our 
experiments,  we  conclude  that  AVL  trees  remain  the  best  dictionary  structure  for 
general  applications. 

We  have  also  proposed  two  simplified  versions  of  the  BBST  called  SBBST  and 
BBSTD.  Both  versions  are  very  competitive  with  BBSTs.  The  BBST  and  SBBST 
provide  slightly  better  balance  than  provided  by  WB(q)  trees. 

Another  new  data  structures,  weight  biased  leftist  trees,  developed  by  us,  has 
better  performance  than  leftist  trees  as  a  data  structure  for  single  ended  priority 
queues.  We  have  also  proposed  a  fixed  node  size  representation  for  skip  lists.  As 
a  data  structure  for  dictionaries,  this  has  a  better  performance  than  skip  hsts  with 
variable  node  size. 
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Table  A.l.  Abbreviations  used  in  tables 


AVL 

Adelson-Velskii  <ind  Landis'  trees 

B-Heap 

binomial  heaps 

BBST 

yS-balanced  binary  search  trees 

BBSTD 

BBSTs  without  deletion 

BST 

unbalanced  binary  search  trees 

Deap 

deaps 

DSL 

deterministic  skip  lists 

Heap 

heaps 

LT 

leftist  trees 

MMH 

min-m<ix  heaps 

MSL 

modified  skip  lists 

RB-B 

bottom-up  red-black  trees 

RB-T 

top-down  red- black  trees 

SBBST 

simple  /3-balanced  binary  search  trees 

SKIP 

skip  lists 

TMSL 

threaded  modified  skip  lists 

TRP 

treaps 

WB 

weight  balanced  trees 

WBLT 

weight  biased  leftist  trees 
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